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TTCTTTAGAT AATCCTTGGA AGAATGATGG ATTTTGAATG TATGTTGAAT TAGGATCAAA 12 6 0 

GTCATATAGA GGTTGATCAG TTACATCAAT CTCATTCCAT AATTCGTTGT TATTGTATAC 13 2 0 

5 GTTATTATAT TCTTCAATAA ATAATTCAGG TGTTACAACA CTATCAACGG TATCTGAAAC 13 8 0 

TTCTTTAATT GATGGCCAAA TATCTTTCAA ATATACATCT TCACCGTCAT TACCTTTACC 14 4 0 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 150 0 

W 

CTGTGGTGAA GCTAGGTAAT TGGCTTTAAC AAGAGGATGG ATACGACCTT CAAAGTTACG 156 0 

GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTTC 162 0 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 16 8 0 

IS 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 174 0 

TGATCCTGGT GCTAGAGAAG TTTTAACGTA TTCAGGAACT TTCAAGCCTT TTTCAACTGC 1800 

zo TTTTTTAGCA ACTAAACCTG cACCTAACAT TACATAAGGG TTAGATGTAT TTGTACATGA 186 0 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CAT CTTTAAA 192 0 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 198 0 

25 TGGAGCTGTT ACAGAATTTT CAAATGATGA TTTCATATCA CTTAAGAAAA TTAAATCTTG 204 0 

AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 210 0 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT T CAAATATTC 216 0 

30 TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC 2220 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT GGTGCCATGT TTG CAATTGT 2280 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGACCA AAGAACTCCA CAAATTTACC 234 0 

35 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 24 00 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC ACCAATAACC TCTGGAATTG GGAAATAAGA 2460 

AGGTTGTCCA AGCATTCCAG CTTCAGCTTC AATACCACCA ACACCCCATC CTAGTACGCC 2520 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 258 0 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATT CTA AGTTAACTTG 264 0 

45 GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 270 0 

ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 2760 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 2820 

50 TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 28 80 

AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAACAC GAGAAGGTTT 294 0 

55 
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GTCTGTAATT ACAAAATCAT CTTCTTGACG AAGTAAAGAT TCTAACAAAA CACGAATTGA 306 0 

ATAAGGTAAA TTGGAAACTT TAGTAATACC TTGCTCTTCT ACAGCTTTTA AATCATAGTA 312 0 

AG TATAACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTG CAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA ATTGTATTAT 3 24 0 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 3 3 00 

TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTG TAATAAG TATTTTTTAT 3 3 60 

TAATTGAGAA TAATTATCAA TTTCG CGAAT GATTCAATTC AATTTTTAAA CGTATTATTT 34 20 

CATTGAGCAG AAAGAAAATT ATGGCAC CAA ACTTTAATAT TTTTTTCAAT GTCATTCTTT 34 80 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 3 540 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3 600 

GCACATTATT GTAAGCTGAC TTTTCGTCAG CTTCTGTGTT GGGGCCCTCA CCCCAACTCG 3 6 60 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAGAATTGAA 3 720 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTTCGTAGAG 3780 

CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA TATTTAATTT 3 84 0 

GCGTTTATAC CGTCAAACTT CACTTTAGCT TTGT CAAACC CCTTTCTATT AAGTTTTCAG 3 900 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT CGAGAATTCG TAGTTTAATA ACGAAATTCT 3 960 

CGTTCTTATC CTTTTGAATA TACT CAATTT TCCACAAAAA CAAACAAGTA GTATATCTGT 4 020 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4 0 80 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AGAATGTCAG ATT CATAATC 4140 

TTCTTGTTGA CTCTTGATAT ATTCTTGTAA GCGATGTTTA TTCGGAGTCA ATGTTAAACC 4200 

TAGGAATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT ACCATCATGA GTATGACGAC 4 260 

AAAGGAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4 320 

GCCGTTACCG CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGC CTGTG AAATTCCCCA 4 3 80 

AACAACTTTT ACCATACTAG ATGGATTTAA TGAAC CAAAT GTTGTTTGCA TTC CTAATAC 4 44 0 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATG CAATCAA 450 0 

CGATAAAACA ATGCCAAATG G CACATG ATT AAACACTCCA AATAGCTGTG TTT CAGGAGT 4 560 

CATATCAAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG CCTAATACAC CAAAGACACT 4620 

AAACCAAACA AAACTAACAA TTGCTGGAAC TAGCAAGACA CCAGAAATGA ACT CTCTAAT 4 6 80 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4 740 
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TGCTGTATCA AAACTATTAA ACAAGAATGT GTTTAGTAAA CTACCCGTAG AGCTAGTTAA 4 86 0 

CATATTTAAA ATAAGAACAG TTGGTCCAAC AATTAAAGCA GCTACCATTA AAATAGTACC 4 92 0 

5 TAAACCAATG TTCAAGTTAC TTAAGTATTG AATACCTTTA CTTAATCCAG ACCATGCACT 4 98 0 

TGCTATAAAT AAGATAGTAA CAACAATGAT GATAATCGCT TGTACAAACG TATTGTTTGG 504 0 

AACATTGAAC AAGTAATGTA AACCACCATT AATTTGTAGA GCACCCATAC CTAACGAAAC 5100 

w 

GGCTACCCCA ACGATTGTCG CAAATACAGA TAAAACGTCA ATAAAAATCC CAATAGGACC 516 0 

TTCTACTTTA TCACCTAAAA GAGGACGTAA AGTTCTAGAT AATAAACCTG GTTCACCTTT 5220 

ACGGAATTGC GAATATGCCA ACGTAACGCA ACAACACCAT AAACAGCCCA AGCATGGAAT 5280 

15 

CCCCAATGGA AAAATGTTGA ACGTAGAGCT TCAGTATAAG CTTCAGTAGT TTTGGGATCT 534 0 

GCTGTAGGTG GCGTAGCAAA GTGCGCCATC GGTTCAGCTG CACCATAAAA CACCAAACCT 54 00 

zQ ATCCCCATAC CAGCACTAAA CAACATAGCA AACCATGAAA TTGTATTAAA CTCAGGTTTG 546 0 

TCATTTGGTT TACCTAGTTT AAGTTTTCCA ATAGGACTAA AAATAAGGAA TATACAGAAG 5520 

AACACGATAA TCGTAGTAAG AATAAGATAA TACCAACCTA ACTTTTCTGT AATCCACATT 5 580 

25 TTAATATTAT TGGTAACATA GTTGAATTGT TCAGGTAAAA ATGCAC CAAG TAATACGACT 5 640 

ATAG CAACAA CAATTGCACT ATAGATGAAG ACTGGTGAAT ACTTCTTTCC ATTTGGATTC 5700 

TCTGGTGAAG AAGAATTCAT AATTAATTAC TCCCTTCAAT TCTATATTTA ATTTTATGTA 5760 

30 GTAGAATAAA AATATTATCT AAACATTTTA TTCAATAACT CACG 5 804 
(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
r (D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

ACCCGCGAAT ATGGTCCATC CTATCGATTT ATTTTTAACT GGTTTGACAA TATTTAATTT 6 0 

TTCATAATCA TTCTTAGTGA TTTTGACATA TGTTTTCGGT ATGAGCCAGT TAATAAATGG 120 

45 

AAAGAAGAAG ACAATCCAAT TACTTGC CAA ATCaATCATT AAATATTCAC TATCGTATTT 180 

GATTATTCGA TATTTAGGGT TTTTATTAAT AACTTTAGAT TCGCAAAGCA ATGTCTCCAC 240 

50 ATCCCTTTAA TTTTATGTGT AATACATTTT TCGATACTTC AAAAGACATT CAAATACTAT 3 00 

CAAGTTACTG TCATCAAAGG TTTTATTAAC TGATATTtTC ATATTTTTAa TCTGAATTTA 360 
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(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCAGGGTGCG GAAAGCTTTA AAATTTGGAC TAATAAAGAT GCTGATATTA ATTCTATGAA 6 0 

AACAGCAGTT TTACAACAAT TAAAAGGAGA ATAACATATG CTTACTGGcA AACAAAAAAG 120 

15 

ATACTTAAGA AGTTTAGCAC ACAATATTGA TCCGATTTTT CAAATTGGAA AAGG CGGTAT 180 

CAACGAAAAT ATGATTAAAC AAATAGATGA TACGTTAGAA AACAGAGAAT TGATTAAAGT 24 0 

20 ACATGTACTA CAAAATAACT TTGATGATAA AAAAGAATTA GCTGAAACAT TAAGCGAAGC 3 00 

TACTCATAGT GAATTAGTGC AAGTGATTGG ATCTATGATA GTGATTTATA GAGAATCTAA 360 

AGATAATAAA GAAATTGAAT TGCCATAATA ATGAAAAAGA TAryACTTTA CGGCGGTCAG 420 

25 TTTAACCCTA TCCATACTGC ACATATGATA GTAGCTAGCG AAGTATTTCA TGAATTACAG 4 60 

CCAGATGAAT TTTATTTTTT ACCTAGTTTT ATGTCTCCAT TGAAAAAGCA CCATGATTTT 54 0 

ATAGACGTTC AGCACAGATT AACAATGATA CAGATGATTA TCGACGAGCT TGGTTTTGGA 600 

30 GATATTTGTG ACGATGAAAT TAAACGTGGT GGTCAAAGTT ATACCTATGA CACGATCAAG 66 0 

GCATTCAAGG AGCAACACAA AGA CAGTG AG TTGTACTTTG TTATTGGGAC GGATCAGTAT 7 20 

AACCAACTAG AGAAATGGTA TCAAATTGAA TACTTAAAAG AAATGG TTAC TTTTGTAGTT 780 

35 GTAAATC GAG ACAAAAATAG TCAAAATGTT GAAAATGCTA TGATTGCAAT TCAGATACCT 84 0 

AGGQTAGATA TAAGTTCGAC AATGATTCGA CAAAGAGTTA GTGAAGGGAA ATCTATCCAA 900 

GTTCTTGTTC CTAAATCCGT T G AAAACT AT ATTAAGGGGG AAGGATTATA TGAACATTGA 96 0 

40 

AAAA 964 

(2) INFORMATION FOR SEQ ID NO: 252: 

( i ) S EQUENCE CHARACTER I ST I CS : 
45 (A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 
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TTGTCATGAA ATAAATGGGA TGAATATCAC GACTAGAAGT AATGTTACGA 


ACAGGAGCGT 






ATAAACTAGA GACGCTAAAT TCGACATAGT ATGTnGCTCA ATTATGGCTG 


ATGATGAATT 




5 


TAAAGTATGT GCGTTGGAAC TGTCGGATTT TTGTTCATAA TGTTTTG CAT 


ATTGCGCCAT 


240 




GATGAATAGT GTAAAAATAA ATAAAACAAT AAGAGATATA ATGCCCATAA 


TCAAAAGTAT 




to 




CTAAATTTAA 




AATAGTTATT TTTGTAATTC TAAAC CTTTT TCATCGCGAA AACAATTAAA 


TAGGTCGCGG 






TATTAATTAT TATATTATTA CCGCTTAATA TGAAAAATAC ATGAaAATTA 






15 




AAGGATTCTA 






GTTTCAAAAG GG CTGATAAG CATAAAATGA AATGTAATAT TTCGATGTAT 








ATT AG CT AAA AATCATCGCA TTAATTTTTT GAGCTACATC ATCAAAATTC 






20 


ACGACA CAT A TAATTTAATT TTAGGTTCAG TACCAGAAGG ACGTAAAGCG 


ATAAATCCTT 






CGTCAAATAA GACACGAATA ACATTTGATT TAGGAGAATT AATCTGCGAC 








TATCCAAATG ATAAACCTCG CTAGTTAAAT AATCTTCAAT TGCTTTCACT 


















CAGTTTGATA 






AATCTGTTCT AATTCATCTT TCAATGTTTT GCCATATAAT TTTAACTCAG 


AAGCGTATTT 


1020 


- 


TATAATGAGT GGCACAATTT GTACGGCATC TTTATCACGT ACAAAAGGCT 


CTGATAGAAA 


1080 




ACCGTAACTC TCTTCAAATG CGAAAATCAT ATTTGATGAT CATCCAGTTG 


TCTTATTTC C 


1140 




TGAGCAATAA ATTTAAGCCC GTCAGCACCT CTTTGGTATT CAACATTATT 


ATA 


1193 


35 
40 


(2) INFORMATION FOR SEQ ID NO: 253: 

£1 ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(C) STRANDEDNESS : double 






45 


(Xt) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 








TATCAGGATG ATTATGnCGG GGTTTTATTA AGTCTGAGCG TATATnCCTT 


TTATTCTCCT 


60 




AAATGGTTTG GAATTCAAGG AGAAGACGAT GAAATGGTAT CCAAAAACCA 


AnGAATATAT 


120 


50 


nGCATTAAGT CTGAGGATGA TAGTGCGGTG GCAATTCGTT CATTAATTTT 


GCATAAAGAT 


180 




GAACCTATGT ATTTAAAAAA ACGTACATGT GTACCTACTT TGTTAATTAA 


TGGGGAACAT 


240 
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AAAAAAATCT TCGAACATTC AGGACATGCA CCGCATATTG AAGAACCAGA AGCATTTATG 3 60 

AATTATTATT TAAAATTTTT AAAAAGCGTA T CAT AATATG TGATATATAA ACCTAGGGCA 4 20 

5 TAAAGTCCTT AGGCAATGTG AAAAAGCTGA TTACTATTCA TTATTTGATA GAAATCAGCT 4 BO 

TTTTTTGAAA TGTATTTGAT ATATACTGCT CGTTATGCGG CTATCTTCCT TATATTAAGT 54 0 

GCCATTAGTG CAAAACCTCT TAACAATTAG GTAAAAAGAG CATAAAAAAA GGAAGTTTAA 6 00 

to 

TAGAATGTAT CAT CTAT CAA ACTTCACCAA ATTGCGCTAA ACAAAATTAT AGTTCAATTT 66 0 

CGTTGTTTGC TTCAGTGATT CGTTTATTTA CTCGA CTCAA TAATGATTCG ATTTTTTTAC 72 0 

GTTGTTGTGC ATTAACAAGA ATTAATACAG TTCTTTCATC ATGCTCATTA CGTTTTTTAT 78 0 

15 

CGAAGTAATC TTCTTGAGAT AAAATTTTAA CTGCTTTAAC AACTTGTGGT TGTTTGTAGT 84 0 

TTAAATGATT AATAATATCt TTAAGATAGT ATTCTTTCTC TTTGTTTTCG CTGATGTATG 900 

20 TCAATACAGC GAATTCTTCA AAGCTAATTG AnAATTC CTT TTTAATTAAA CTTTTTAATT 96 0 

TGTCAGCATA AGTGACCATT GATAACAACT CAAAGCAATC ATTGATTTTT GTAATTGCCA 102 0 

TGTTTAAAAC CTCCCTATTT GATGCATCTT GCTCGATACA TTTGCCCCGA TAA TATAnTG 108 0 

25 TATCTAATCT TTATGnAT 109 8 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

CCAGGTAAAA TTGTGCAATC ATTTGACGCA TTAATGGACG CCTTGGACAA TGAAGATTAT 6 0 

GAAGGAGAAA AAGTCATTCC ATTCTTAGAT AAACATTTTA AATATCAAGA TGGCCGATCA 12 0 

40 

AGTGAGCGTT TAGTCAGAAA TTTATTTGGT AGCTAAGTTT ATATAGTAGT CAAAGTGGGA 180 

GAGGTATAAT GATGAAATTT TCAGTAATAG TTCCAACATr CAATTCAGAA AAGTATATAA 24 0 

CAGAATTACT TAATAGCCTT GCGAAACAAG ATTTTCCGAA AACTGAATTT GAAGTGGTTG 300 

TAGTTGATGA CTGTTCAACA GATCAAACGT TACAAATAGT TGAAAAGTAT CGCAATAAAT 360 

TGAACTT GAA AGTAAGTCAA CTCGAAACAA ATTCTGGTGG TCCAGGTAAA CCTAGAAATG 420 

SO kGGCGTTAAA ACAAGCAGAA GGTGAATTTG TATTATTTGT GGACTCCGAT GACTATATAA 480 

ACAAAGAGAC TTTAAAG GAT GCAGCAGCAT TTATTGATGA ACATCACTCA GATGTCTTAT 54 0 



£5 
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CACCTGAAGT TACTTTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 66 0 

ATAGAACAGC ATTACTAAAA GATAATGACA TTTATTTT C C AGAAGAATTA AAGAGTGCAG 720 

AAGATCAATT ATTTACAATG AAAGCATATT TAAATGCAAA TCGAATCAGT GTGTTAAGTG 78 0 

ATAAAGCGTA TTATTATGCT ACAAAG CGTG AAGGTGAACA TATGAGTAGT GCGTATGTTT 84 0 

CACCTGAAGA CTTTTATGAA GTCATGAGAT TGATTGCTGT AGAAATATTA AATGCAGATT 900 

TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAGTTTTT 960 

CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 102 0 

CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCATTGGTG ATGAGTAAAT 10 80 

TACGACCATT GTTGCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA ACTGTGGAAG 1140 

AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATATTGT AGATGGTAAA TTAAACATTC 1200 

AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 12 60 

TGACAGCATT TAAATTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 13 20 

TGATTGGCGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 13 8 0 

CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 144 0 

AAGATATTAA AGCGTATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 15 00 

TAGGTACGGA AGTATTTGAA GTGCGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 15 60 

CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 162 0 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 16 80 

CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 174 0 

TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATGTTAAGCC 180 0 

AAACfiTCAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC 1860 

AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATG 192 0 

CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT TATGACATGA 1980 

TTATGACATT TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TGGAAATGAG 2 040 

GAGGTTATCT ATAATGATAA ATAAAAATGA CATAGTAGCA GATGTAGTAA CTGATTATCC 2100 

GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG GACAAGTAAG 2160 

TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AACGAATTAT TACAGCG TCT 2220 

CAACGACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC 228 0 

ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAGAGAAG AATTTAAAAA 234 0 
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TGAGTTAAAA GAAACATACG ATACATTTAA AAATGGCATG TTAGAGCATA TGCAAAAAGA 24 6 0 

AGACGATGTC GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGTAG TAGACGATAT 2 52 0 

5 TAATACTGTG ATAGATGATT TAGTTTCAGA CCACATTGCA ACGGGAGAAT TGTTAGTAAA 2 580 

AATGAGCGAA TTAACATCTA GTTATGAACC TCCGATAGAA GCGTGTGGTA CTTGGCGACT 264 0 

TGTTTATCAG AGATTAAAAG CACTTGAAGT GTTAACACAT GAACACGTAC ATTTAGAGAA 2700 

TCACGTATTA TTTAAAAAAG TATCATAAAT AACGCGATTA GAAACTGTTG GCAAAAATAA 2760 

GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAGTG ACATAAACAG CATTATTTGA 2 82 0 

AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT ATAGATAATT 2 8 80 

IS 

A 2881 

(2) INFORMATION FOR SEQ ID NO: 2 55: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

ACCGTCGAAT ATCGCTTGTG ATTTACAATT TGTGTATTAA GATGCTCAAC TAATTTGGGT 60 

30 ACATATTCCG AATTTAGATT TGCAAGTACA ACAATTCCAT AATTTTGTTT TGGATTTAGT 12 0 

AAAATAAATG ATGAAAAGTT ATCTAGCGTT CCTGAATGAA ACACTAAATG TTCATCATTA 18 0 

TTGGTAAACC AGCCGGAAGC ATATGCATTG GCATTAGGTT CACCAATTGT TGAAGATAAA 24 0 

35 TTTTTATGTG ATTGTTGAAC TAATGATTTG TATTTATCAG GTGGATTAAG TTGGAATTTT 300 

ATCGAATGTT CCAAATCTTC AGTTGATGTC ATCATATATG CTGATGGTGT ATCCCAAAGG 36 0 

TTAAATTCAG GTTTAGAGAC GACAGGTGTC GAACCTTGTA ATTCATAGCC AATAGCATCA 42 0 

40 

TGTTTTGATT TGTAATTGGT TTGTTTGAAT GATGTATGTG TCATATGCAA AGGCTTGAGC 4 80 

CATGAATTTG TAATATATTT TGTATAGGAT TGCTTCGTAA CGTTTTGGAT AATTAAACCT 54 0 

AATAAAT CAT AGTTCATATT TGAGTATTCA AATTCTTCTC CGGGCTTATG ATGTAATTCA 60 0 

45 

TCACCCATAA TTGCATGGGT TACATCATTT AAACGATTAT TTTTGCTTGT CACAGAATCT 6 60 

tCGCTTGTAA TAtCACTAGG TATACCACTT GTTTGAGCCA AAAGTTGCTT AATCGTAATA 72 0 

so GTTTCATTTT GACCATTATA GTTCATTTTA AAATGAGGCA CATGTTTGGA TACGGCATCA 780 

TTTAAGTTTA ATCGACCTTC TTGAGCTAAT TTTAAAATTG CAAGACCTGT GAAAGCTTTC 84 0 

55 
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TGATAAC CAT AAC CTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 96 0 

CCAGGAATGT GTCCCTTTTG TAAAT CATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 102 0 

T CATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 10 56 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH : 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 6 0 

ATTATTACAA GTATTTTATT AAATGCTAAG CCGCATGAAG TTAAA CTTAT GTTAATCGAT 120 

20 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCC GGTTGTT 180 

ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 24 0 

2S TATGATTTAT TCCAACATTC ATCAACTAGA AACATTAAAG GTTATAACGA ATTAATCCGT 3 00 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 3 60 

GATGAGCTTG CAGATTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC GATTCAACGT 420 

30 ATTACACAAA TGGCACGTGC AG CAGG TATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 540 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTGGT 6 00 

35 AAAGG TGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 660 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TAGTAGAACA ACAACAGGCA 720 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 780 

40 GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AACAAAAGGC aAGTACATCA 840 

TTGTTACAAC GCCAATTTaG AATTGG t TAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900 

GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 960 

45 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 1020 

ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGAC CAAC TTC CAAGTAA 1080 

5Q TTTATCAATT GCCAGAGAAT TAAATGTAAA AACCGACGAT GTTTATGAAG CAATTCAGcA 114 0 

TTGATTACTG AACAAGTCAT TAAAGATAtT TTGAAGAGGG CACAAGTGTT AAGTCACTGC 1200 
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GrtTTGAATG CGGAACT 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 6 0 

TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACG CTT TTATACACTA TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 24 0 

TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA AATTGTCTTA 300 

ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 3 60 

TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 4 20 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCT CA AAAATACAAC 4 80 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 54 0 

CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCCACAAGCT 60 0 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 660 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 720 

CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 780 

ACAC23TAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 84 0 

TGTACATGGT ACGTATTTGA CCGTCGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 90 0 

TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTAC CAAGT AAACAACACA 96 0 

CCATCAGTTG GTT CAATT AT GCAAAGCACA CCTGGTCCAT ATGGTCATGT TGCTTATGTT 102 0 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC ATATGGTCCA 10 80 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATTCATCCAT 114 0 

TAATTAAATA AATTGTACTG ATATATACTA G CAATT CACA TCATGTGAGA TTGCTAGTTT 120 0 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 126 0 

ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 132 0 
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ATTTAATTTG TTATACCAGT ATTTTACGCT TTTTCGTCTA CATATA CAAA TTTATATTAA 144 0 

ATAAAGCCCA ATACAATTTA GGTTAATTAA ACAAGTTGAT AACTATTTAA TTATTCCTTC 150 0 

5 ATTGAAGAAT ATAAACTATT AAATCATTAT TTTGCTCTTA CATATATTTT AATGACCTAA 156 0 

CTGaTTATGT TCCATGGAAT ACATTTATAA TATAGCCTCC TAATTAArAT GCyTTGTCTT 1620 

GGTCATTCTA CGTAAATTCT ATAAAATATG TTATCTACTT ACATAArArn CTGrACTTCA 168 0 

10 

ATACCACCAT ATGTTTGTGA TACTGAAGTT CAGTTTAGTT TTATTTTCAA TTAGAAAAAT 1740 

AAGTTAAGTA TATAGAATAG TAAACCTGCT AACAATGCTG AAATAGGTAA TGTAATCACC 1800 

CATGTAATGA TCATTCGTTG CGCAgTGCTC CATTTTACAC CTTTAGCTCG GTTAGAAG CA 1860 

15 

CCAACACCTA AGATTGATGA TGACACAACG TGAGTTGTTG ATAATGGGAA ATGTAGCGAT 1920 

GATGCAACAA AAATTGTTAA TGCAGATGAT AAATCGGCCG CAGCACCATT TGCTGGACGT 1980 

2Q ATTTT CAT AA TATTACCACC TACAGTTTTG ATAATTTTCC AGCCACCAAT TGCAGTACCA 2 04 0 

AGCCC CATTG cTGTCGCACA GGCAAATTTT ACCCATAACT GTGGTT CAAC ACTGCCATCA 210 0 

TTCTGTACAT TAGCGACAAT CAATGCCAAC GTAATAATAC CCATTGATTT TTGCGCATCA 2160 

25 TTCGTACCGT GAGAGAATGA TTGTAACGCT GCTGTGAAAA TTTGGAAAAA TCTAAAGTTA 2220 

CGATT CGCTC TTGTTAAATT TGCATTTTTA AAGATAACTT TAAAAATTGA ATACATCAAG 22 8 0 

AAACCAACAC AAAATGCGAT AATCGGTGAA ACGATTAATA CAATAATAAT TTTTGTGAAA 234 0 

30 CCTTGGTAAT GTAACACTCC AAATGAGCCT TCAGATGCGA TTGCTGCACC CGCAATTGAA 2400 

CCTATAAGTG CATGTGAAGA CGAACTTGGA ATTCCGTAAA ACCAAGTAGC TAAATTCCAA 246 0 

ATAATAGCCG CAAGTATTGC AG CTAACACA ACAACTAATC CATTTTCCAA TTTAAATGGA 2520 

35 TCGACAATGT CTTTAGTAAT GGTGCCTGCA ACGCCCGTAA ATGTTAAAGC ACCTATAAAG 25 80 

TTCATCACTG CTGCCATTAA AATTGCCGTT TTAGGgTTAA CGCTCTAGTA GATACAGCAG 264 0 

TAGCTACTGC ATTGGcTGTA TCATGGAaTC CcATTGATAA AGTCAAATAT CAG CGAGAAA 2700 

40 

ATAACTACAG CTATAGTGAC GATGATTATA TATGACATAA ATATATACTC CCCTTAGCTA 2 760 

TTTTTCATAA TAATAGTTTC AAAATTATTT GCTACGATTT GACATTTATC AGCGATTTCT 282 0 

TCCATGCTTT CATAAATATC TTTTATTTTA ATTAAAGTGA TTGGAT CTGT TTCGCTATTG 2 88 0 

45 

AAAATATGTT TAATTGACTG TCTTAAAATA CCATCACAGT TTGTTTCAAA TTCTTTAATA 2 94 0 

TTAATTGAAT GAATACGCAT ATGTGATAAT TTTTTATCGA CTAATAAGCC GACAGCAAGT 3 000 

so TTCATTTCTG CAACTGCTTT TTGAATGTTA TCAACAAACT CAGCCATATA TTCATCTGTG 306 0 

TATTCGATTG AATACATTTC AAACATrGCT GCCGTTTCTT CAATTGCATC TAAAACATCA 3120 
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TTTAAATCAG TAATTACTTG ATGTACTAAt t CGcACCATG TGACTCATAA GTTTTAATGT 3 24 0 

TGTCTGAGTA TGCTTTTAAA TCTAAATGTG TATTGAAATC CATTTTACCG AATTCAATAG 33 0 0 

5 CAGCACGATC CAGATTGAAA ACCATCTCTT CTAATTGAAC CATAAACTTA TCTTTTTTCT 336 0 

TACTAAACAT TTAAAATCCT CCATTTAAGC GATTGTCACC AATCACATTC AGTTATAATT 34 2 0 

TGTTTCAAAT TAAGACAAGT GAATTTACAA ACTAATGATA CAAATTTGTT ATTATCAATC 34 8 0 

10 GTCAGTATAA TTTTAGTGTA CTGATATTAA TTTCAAAAAT GCCTCACAGT AAACAATTTA 3 54 0 

CTGTATTTGC CCTTATA 3557 



(2) INFORMATION FOR SEQ ID NO: 258: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

2s AACTATACAT TTCGAAAAAT TCTTCTAGTG AACCTGCGCC ACCAGGAGCC ATGACAAATG 6 0 

CATCTGCAAG TTCTGCCATT TTATTTTnAC GTTCATGCAT AGAATCAACT AAAATTAATT 12 0 

CAGTTAAACG TTGGCTTGTG ATTTCATGTT CATCTAACAT TTTAGG CATG ACGCCAATAG 180 

30 CTTTGCCGCC ATGATCTAAT ACACCATCTT GaATGGCACC CATAATG CCA ATTGACCCTG 24 0 

CACCAAATAC TAATTCATAA CCTTGTTCAG CAAAATATTT ACCTAAATCG TATGCTTTTT 300 
GTACATATGA AGGGTCATGA CCTTTGCTTG CACCACAATA AACTGCGATT CGTTTCATGT 360 

35 TAATCCAGCT CCTTAATTCG ATGAATGACT TTTAATAGTG ATTGTTCAAA CACTTTTTGA 4 20 

TCTTGCTTTG TAAAAGGTGG GGGACCTTTG TGGCGACCAC CTTGTTTTCT AATTTGTGCA 480 
TTCATATATC GTTTATCTAA TAGTTGTTGA ATATTTTTGG AATTGTATAT CTTCCCATTA 54 0 

40 TGATGCATGA CAATTAAGAC TTTGTCGACT AATAAACTTG CGAGTCCATA ATCTTGAGTG 600 
ACTACGATAT CATCCTTCGT TGATAATTGA ACAATTTTGT AATCAACTGC ATCTGGTCCA 660 
T CATC AA CAT ATAATGTTGA TACATGTGGA GGATATAATT GGTTCGAAAA ATGGCTGAAG 720 

45 

CTCCGAATAA TTGTCACAAA AATGCCTGTC TCAGTTGTTA AATCTATAAT AGAATCAACA 780 
A CAGGACAAG CATCTCCATC AATAATAATA TGTGTCACAA TTATGCCTCT GTATTGTTTT 84 0 

CTTTATTTTG TTGAGAGGCG CTTTTGGCAA CATAATCTTT ATATTTTTTA AATGACTTGA 900 

50 

TGCGTGCTTT ATCAGCTTCT TGTTGGCGTT TTTGTTCTTC TTTGTGTCGT TTTTCAATAT 960 

55 
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CGCC TTTTT T CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 10 80 

ATT T TTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT 1140 

GCTCTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTTCTC AATATAGGCA GGATCATGTT 1260 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 13 2 0 

ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTG CTGCAGA TGTTTTTGAA GCAACTTTAC 13 8 0 

TTGTAGCATC TGTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTCAACAA 14 4 0 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT 1500 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 156 0 

TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 162 0 

AATTAnAAAA A 1631 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology -• linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

CGAAATCATG ATTTAATGCT TTTTCATATA AGCTTTTCCA ATTAATCTTT CGTCCATGAT 6 0 

ATTC TTCAAC TGTTGCTAGA TATTG TGCAA TTTTAGTTAC TTTAAAGGAG TGTGCTGCAA 12 0 

35 CaTTGTGkTC mAAATATTTA AATTTTCCaG GtAATCTTAT AAGTCTTTCC aTATCTGATA 180 

ATCTtTTAAA ATATTGATGT ACACCCATTT CAATTACCTC CTCCATTAAT TAATCATAAA 24 0 

TTATACTTTC TTTTTACATA TCAATCAATT AAATAT CATT TaAATATCTT CTTTaTATAA 300 

cTCTGATTAA ATGATACCAA AAAATCCtCT CAACCTGTTA CTTAAACAGG CTAAGAGGAT 3 60 

AGTCTTGTCT TGATATATTA CTTAGTGGAT GTAATTATAT TTTCCTGGAT TTAAAATTGT 420 

TCTTGAAGAT TTAACATTAA ATCCAGCATA GTTCATTTCA GAAACAGTAA TTGTTC CATT 480 

45 

AGGGTTTACA GATTCAACAA CACCAACATG TCCATATGGA CCAG CAGCTG TTTGGAAAAT 54 0 

AGCGCCAACT TCTGGTGTTT TATCTACTTT AAATCCTGCA ACTTTTGCTG CGTAATTCCA 6 00 

50 GTTATTTGCA TTGCCCCATA AACTTCCTAT ACTTCTACCT AATTGTGCAC GACGATCGAA 6 60 

AGCATAATAT GTGCAGTTTC CATAAGCATA TAAGTTTCCT CTGTT AG CAA CTGATTTATT 720 



55 
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TACATTAACT GTCTTAGTTA CTGCTTGCTT AGGTGCTTGC TTAACTACTA CTTTTTTAGA 84 0 

TGCTTGTTGT ACAGG TTGTT TTACTACCTT TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 900 

5 TTTAACCGCT TTAGTTTGTT TCACTTTATT TTGAGGCACA AGTGAAATCA CGTCACCAGG 96 0 

AAAAATTAAA GGTGTTACAC CAGGATTGTA TGAATATAAT TGATTCAACG TTAAGTGATG 102 0 

CTCTAAAGCA ATCTTATATA ATGAATCGCC AGCAACTACT GTATAAGTTG TCGGTG ATTG 108 0 

10 

CGTTTGTGCT TGAACATTTG ATACATAATT ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 114 0 

ATGTTGTTGT GCATG TGCTG CATTATTTAA AGCTAAAAAA GCTAACACTG ACGAAACCGT 12 0 0 

CACTGTAAGA GATTTTTTCA TCTTGCTGTC ATTCCTTTGC TGTTAGTATT TTAAGTATGC 12 6 0 

15 

AAATACTATA GCACAATACA TTTTGTCAAA AGCTATTGTT ATAACGATGT AATCAAATGG 13 2 0 

TTAACAATAT AAAAAGAATA CAACCTTTTA TCATAGTGTA AAATGTATTC ATACCATGTA 13 8 0 

2Q ATTGAGAACG TTTTCAATAA TTAATTCAAT AC CTTGAAAA TCGCCATAGG TAATATTACT 144 0 

AAATGCAGAC TGCATATGTT GTTTTAACAA ACACAACTTT TAAAAAATAT ATTCTAACTC 1500 

TATCTACCGA ATTGTACTTA AATATTCATA AACAAATCAT ATTCCAAAAT CTAATTTACA 156 0 

25 ATTTATTTAG CTACCTTTAA AAAAACCAAA AACCGACGCC CTTTTAGAGC CTCGGTTTTA 16 2 0 

AAATATATAT TAATCGTGCG ACATTGTCTG TCTTAAATAT GATTCGATAA ATGGTCCAAT 16 8 0 

GTCTCCATCC ATCACTGCAT CAACCTTACC TGTTTCTTCG TTCGTACGAT GATCTTTCAC 174 0 

30 CATTGAGTAT GGATGGAAAA CATATGATCT AATTTGGCTT CCCCAGCCGA TTTCTTTTTG 18 0 0 

TTCGCCACGA ATTTCAGCCA TTTCACGTGC CTGCTCTTCC AATTTTAATT GATATAATTT 186 0 

AGACTTTAAC ATTTTCATAG CTGCTTCACG GTTTTTAATT TGAGAACGTT CATTTTGGTT 192 0 

35 ATTAACAACT ATACCTGAGG GGTGGTGGGT AATTCGTATT GCCGATTCAG TTTTGTTAAT 198 0 

ATGCTGACCA CCTGCACCAG AAGCTCTGAA TGTATCAACT GTAATATCAT CCGGATTGAT 204 0 

TTCAATCTCT ATTTCATCAT TATTAAAATC TGGAATAACG TCGCATGATG CAAATGATGT 2100 

40 

ATGACGACGT CCTGATGAAT CAAATGGAGA AATTCGTACT AGTCGGTGTA CACCTTTTTC 216 0 

AGCTTTTAAA TAACCATAAG CATTATGCCC TTTGATGAGC AATGTTACAC TTTTAATCCC 222 0 

CGCTTCATCC CCAGGTAGAT AATCAACAGT TTCAACTTTA AAGCCTTTCT TCTCACAATA 22 BO 

45 

ACGTTGATAC ATTCTAAATA GCATATTAGC CCAATCTTGA GACTCCGTGC CACCTGCACC 234 0 

AGGATGTAAC TCTAGAATTG CGTTATTGGC ATCGTGAGGC CCATCTAATA AT AATTG CAA 24 0 0 

50 TTCGTATTCA TCCACTTTAG CCTTAAAATT AATGACCTCT TGCT CTAAGT CTTCTTTCAT 24 60 

TTCTTCATCA AATTCTTCTT GTAATAAATC CCAAGTAGCA TCCATGTCAT CTACTTCTGC 2520 

ss 
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CCCAAAAATT 


AGGTTCTGCC 


ATCATTTCTT 


CATATTCTTG 


2640 




AATATTAGTT 




CTAAGTCAAA 


GAGACCCCCT 


AATTTGTGTT 


AAAT CTTGAT 


2700 


5 


TATACTTATC 


TATATTTCGT 


TTGATTTCTG 


ATAATTCCAT 


AGcATTCGCT 


CCTATTTATA 


2760 




TTTCAATTCA 


AGTCATTGAT 




TATAATGCTA 


AATTTTAACA 


TAATTTTGTT 


2820 


10 


AAATAACAAT 


GTTAAGAAAT 


ATAAGCACAC 


TGACAATTAG 


TTTATGCATT 


TATTGTTAAA 


2880 


AATCAGTACA 


TTTATCATCG 


ACAT ATG C CT 


AAACCGATTT 


TTTAAAACTA 


AGTACATAAC 


2940 




AACGTTTAAC AACTTCTTCA 




AGTATTTAAC GCTTGTAAAA 


TAAAAAGACT 


3000 


15 


CCTCCCATAA 


CACAAACTAT 




TTGGAAGGAG 


TTATTTTATA 


T CATTT ATTT 


3060 




TC CATGGCAA 


TTTTTGAATT 




ACCACATGGA 


CAATCATCGT 


TACGACCAAC 


3120 




TTGATCGCCT 


TTAACGATTG 




CACTTTTTCT 


TTACCATCTT 


CAGCTGAAAC 


3180 


20 


GTGCTTCGCT 


TCACCAAACT 




TTCACGTTCA 


ATATTATCTT 


CAACTTGTAC 


3240 




TACAGATTTT 


AAAATGAATT 


TACAAGTATC 


TTCTTCAATA 


TTTTGCATCA 


TGATATCAAA 


3300 




TAATTCATGA 


CCTTCATTTT 




TAATGGATTT 


TGTTGTGCAT 


AAGAACGTAA 


3360 




GTGAATACCT 


TGACGTAATT 




GTCGATATGA 


TGAGTC CAAT 


GGCTATCAAT 


3420 




AGAACGAaGT 


AAAATCATAC 


GCTCAAACTC 


ATTCATTTGT 


T CTTCTAAG A 


TATCTTTTTG 


3480 




ACTTTGATAT 


GCTGCTTCAA 


TCTTAGCCCA 


AACGACTTCG 


AAAATATCTT 


CAGCATCTTT 


3540 




ACCTTTGATA 


TCATCCTCTG 


TAATGTCACC 


TTCTTGTAAG 


AAGATGTCAT 


TAATGTAGTC 


3600 




GATGAATGGT 


TGATATTCAG 


GCTCGTCATC 


TGCTGTATTA ATATAGTAAT 


TGATACTACG 


3660 




TTGTAACGTT 


GAACGTAGCA 


TTGCATCTAC AACTTGAGAG CTGTCTTCTT 


CATCAATAAT 


3720 


35 


ACTATTTCTT 


TCGTTATAGA 


TAATTTCACG 


TTGTTTACGT 


AATACTTCAT 


CGTATTCTAA 


37B0 




GATACGTTTA cgcgcgtcga 


AGTTATTACC 


TTCTACACGT 


TTTTGTGCTG 


ATTCTACAGC 


3840 




TCTTGATACC 


ATTTTTGATT 


CAATTGGTGT 


AGAGTCATCT 


AAACCTAGTC 


GGCTCATCAT 


3900 


40 


TTTCTGTAAA 


CGTTCAGAAC 


CAAAACGAAT 


CATTAATTCA TCTTGTAATG 


ATAAATAGAA 


3960 




GCGACTATCC 


CCTTTATCAC 


CTTGACGTC C 


AGAACGACCA 


CGTAACTGGT 


CATCAATACG 


4020 


45 


ACGAGATTCA 


TGTCGCTCTG 


TACCTATTAC 


TGCTAAACCG 


CCTAATTCCT 


CTACGCCTTC 




ACCTAATTTG 


ATATCTGTAC 


CACGACCAGC 


CATGTTAGTG 


GCAATAGTAA 


CGGCACCTTT 


4140 




TTGTCCAGCG 


CCTGCAACAA 


TTTCAGCTTC 


ACGTTCATGA 


TTTTTCGCAT 


TTAACACATC 


4200 


SO 


ATGACGGATA 


CCACGTTTTT 


TAAGTAAATT 


TGAAATATAT 


TCAGAAGTCT 


CAACTGCAAC 


4260 




AGTACCTAAT 


AGCACTGGTT 


GCCCTGCCTT 


GTGTTTTTCA 


ACAACATCTT 


CTACTACTGC 


4320 
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TTTATTTGTC GGAATTTGAG TTACTGTCAT GTTATAAATA TTTCTAAATT CTTCTTCTTC 4 44 0 

AGTTTTAGCT GTACCTGTCA TACCCGCAAG TTTATTGTAC ATTCTGAAAT AGTTTTGGAA 4 50 0 

5 TGTAATAGAC GCCATAGTTT TAGATTCATT TTGAATTTGA ACGCCTTCCT TCGCTTCAAT 4 56 0 

AGCTTGGTGT AAACCTTCCG AGAAACGACG GCCTGGCATT GTACGTCCTG TAAATTGATC 4 62 0 

GACAATTAAT ACTTCGCCAT CAACAACCAT ATAGTCTACG TCACGTTGTA ATGTAACGTG 4 68 0 

'° CGCACGTAAA GCTGTGTTGA TATGACTAAT AACATCAACA TTTTGTACAT CATATAAGTT 4 74 0 

TTCAACTTTG AACATACGTT CAGCTTTATC CGCACCTTGT TCTGTTAAAT GTACAGCTTT 4 80 0 

CGTTTTTTCA TCGTATTTAT AATCTTCGTC CTGTTTTAAC ATTTTCGCAA AAACATTTGC 4 860 

75 

TTGTGTATAA AGTGACGTTG ACTTTTCAGC TTCACCAGAA ATAATTAATG GCGTACGTGC 4 920 

CTCGTCGATT AAAATTGAGT CAACCTCATC AATGATTGCA AAATGTAATG GACGCATTAC 4 9 80 

TCTATCTTCA GAATAATTCA CCATGTTATC TCGTAAGTAA TCAAAACCTA GCTCATTATT 504 0 

20 

AGTACTGTAA GTAATGTCTT GTGCGTATGC TTCACGTTTT TCTTCTGTCG TCTTACTGTT 5100 

TAAGTTTAAT CCGACAGTCA AACCTAAGAA GTTATATAAC TCAG CCATTT CTTCACTTTG 516 0 

2S AACACTTGAT AAGTATTCAT TGACTGTAAT AACGTGAACA CCTCTACCAG CTAATGCATT 5220 

TAAGTATGTT GGCATTG T CG CTGTTAATGT TTTACCTTCA CCTGTTCTCA TCTCAGCGAT 52 80 

ATCACCTTTA TGAATTGCAA TACCACCCAT AATTTGAACT TTATATGGTG TCATATTGAA 534 0 

30 TACACGTTTA GAGCCTTCTC TAACAAGTGC ATATGCTTCT GGTAAAATTT TATCTAAATA 54 00 

ATCATTTTGC TTTTTGACAT TATCAATGTC AGCTAATTCT GTTTGGAATT GTTTCGTTTT 54 60 

ATTACGAATT TCTTCATCAG TTAAAATTGC CGTTTTTTCT TCTAAAGCGA TTACTTTATC 5520 

35 AGCAAGTTTA CCTAACTGTT TAATTT CTTT ATTATTGCCA TCAAGAATTT TTGATAAAAA 5580 

TCCC&TTTCG TTCGCTCCTT TAGCTAAAAA ACTGTTTGGC CTACAACAAT ATATCTTATC 5640 

ATTTATAGTT AGAAAATTAT ACTTATTTAC TCATTTGTAG AATCAATATA AATATATTTA 5700 

40 TGACATACTT CATTCACATT CTGTTGTCAA CAAGTTTATC ACTAATAAAT ATATT CTCAA 5760 

TACGCAATTA TACTTCCTAA TAAATTATAT TATAAATATT TTACGATTTT CGACTCGGAC 5820 

TATACAATAG ACTGACATAC TATTATTAAC TTAACATTCA AATATATACA TCCATTAACA 58 80 

45 

TTAGCATAGT CACTATGTTT CATTCAACAA ATTACATTAT CGAACTATGA AATAGTCATA 5 940 

ATTTGCTTTT GGAGTATAAA AAAGCACTTG TGCAAAAACA CAAGTGCTTT AAACTTAATT 6 000 

TATTGTTCAC TAGTTTGAAT CAAGCCATAT TTACCGTCTT TACGGCGGTA AACGATACTT 60 60 

50 

GTTCCATCAG TTTCTCTGTC TGTGAATACA AAGAAGTCAT G AC CTAATAG ATT CATTTGT 6120 

55 
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ATCTCGTTAT CATCGTAAGC GTCATTATCA ACTTGTGTTT CTTGCATTTC TTGTAATTCG 6 24 0 

GCAACAAACA CTT CTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACG TGTTTTATAT 63 00 

TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 6 3 60 

TTT CGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 6420 

GCTGAATTTG AATAAGTTTT AACTTTAACA TGCGCCACTG CATTTGGTAC GTCATTAAAA 64 80 

TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAG CATC TGTGATAGTG 6 54 0 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 6600 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 664 5 
(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 3 0 baae pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

CAGTTCCAGC ACAtCTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 6 0 

AAACGGTGTT GACGAACAGC AACATACTGA AAATTTAACT AAGAACTTGC ATAATGATAA 12 0 

AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTTG AATAAAGATC AACTAAAGGA 180 

TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 24 0 

TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATTAATGATG GTAAACAAAG 300 

TAAAGTGGCG TCACAGCAAC AGTCAAAAGA GGCAGATAAT AGTCAAGATT CAAACGCTAA 3 60 

TAACAATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 42 0 

AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 4 80 

TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 54 0 

TCAAAAAGAC GAAAAAACAC TATCAGTTTC AGATTTAAAA AACAATCAAA AATCACCAGT 600 

AGAACCAACA AAGGACAATG ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTGCAGT 66 0 

AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCGAAAG ATGATCAAAC 72 0 

GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATCGTTT TAGTGCATGG 7 80 

TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 840 

TAAAATGAAC ATTCGCCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 900 
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TCGTGTAGAT TATGGTGCAG CACATGCAGC AAAATATGGA CATGAACGTT ATGGAAAAAC 1020 

ATACGAaGGA ATTTACAAAG ACTGGAAACC AGGACAGAAG GTACACCTAG TTGGACATAG 108 0 

5 TATGGGCGGT CAAA CGATAC GTCAACTAGA AGAATTACTG CGTAATGGTA ATCGTGAAGA 114 0 

AATAGAGTAT CAAAAGAAAC ATGGTGGcGA AATTTCTCCA CTATTCAAAG GTAATCATGA 120 0 

CAATATGATT TCATCAATTA CTACTTTAGG AACACCACAT AATGGTACAC ACGCATCAGA 126 0 

'° TTTAGCTGGT AATGAAGCTT TAGTGAGACA AATCGTATTT GATATCGGTA AAATGTTTGG 132 0 

TAATAAAAAT TCAAGAGTAG ACTTCGGGTT GGCTCAATGG GGTCTAAAAC AGAAGCCAAA 13 BO 

TGAATCATAT ATTGATTATG TCAAACGCGT TAAACAATCT AATTTATGGA AATCAAAAGA 144 0 

15 

TAATGGATTT TACGATCTGA CGCGTGAGGG TGCAnCAGAT TTAAATCGTA AAACGTCGTT 1500 

GAACCCTAAC ATTGTGTATA AAACATACAC TGGTGAAGCA ACGCACAAAG CATTAAATAG 156 0 

CGATAGACAA AAAGCAGACT TAAATATGTT TTTCCCATTT GTGATTACTG GTAACTTAAT 1620 

20 

CGGTAAAGCT ACTGAAAAAG AATGGCGAGA AAACGATGGT TTAGTATCCG TTATTTCTTC 16 80 

TCAACATCCA TTTAATCAAG CTTATACAAA AGCGACAGAT AAAATTCAAA AAGGCATTTG 174 0 

25 GCAAGTGACG CCTACAAAAC ATGATTGGGA TCATGTTGAC TTTGTAGGAC AAGACAGTTC 1800 

TGATACAGTG CGCACAAGAG AAGAATTACA AGATTTTTGG CATCATTTAG CAGACGATTT 186 0 

AGTGAAAACT GAAAAGCTGA CTGATACTAA GCAAGCATAA TTTATAAAGT AAAGGGAGGA 1920 

30 ATTAATAATG ACTGCAGACT TCTTTCAATT AATCGGATCA TTATTTAGAA TTCTAAAAGA 1980 

ATTATTCAAG TAAAACATTG GCGAGGCCCC AACATAAAGA ATTTCGAAAA GAAATTCTAC 2040 

AAACAATGCA AGTTGGCGGG GCCCCAACAA AGAAGCTGGC GGAAAGTCAG CTTACAATAA 2100 

35 TGTGCAAGTT GGCGGGGCCC CAACATAGAA GCTGGCGGAA AGTCAGGTTA CAATAATGTG 2160 

CAAGTTGGGG TGGGACGACG AAATAAATTT TGCGAAAATA TCATTTCTGT CCCACTCCCA 2220 

TTGGCATTTA CGAAGTTTAA ATGTGCAATT AGAATATATG TATAACAATA TTAAACACGC 2280 

40 GGTAAAACGA AGTCAGTCAA TTCAAACTGA TTTCGCCCAC CGCGTGTTTT TAACATAGCT 234 0 

TAATAATTAA TAAGCATTAA TGTTCAATTT ATATGGTTGT TTTCCAATAA TAAACCTAAA 2400 

GATATAGAAT TCACGCAATA TCATGCCGAC ACCTATACAT AATCCTAAAA TGAATAGTAG 24 6 0 

45 

TGATATCGCT AGAAAGACCA TTGTATTATC CTCAAATATA TTTGTATATG CAAACAATGA 2 520 

GTCTAGAATG ATTGGATGTA ATAAATAAAT AAAGAATGAG AAAGCACTAA TCATTTGAAT 2580 

CGTATTAAAT AACATTGTTT TAAAATGCGT GCAAATACCC AAGATAACAA TAAACATAAT 2 64 0 

SO 

ACTATTATAT GGTGTTAATG AATATGAAAA GCTGGTAACG TTCCAATAGT CTCCaTTTGT 2 700 
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TCTAAGAAAT TTAATACACG TTCGTAGTTA TAACCCATAT ATGCACCTAA GAAGAAATAA 2B20 

AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATGGAT AATAGTGTAG CACGGTATCG 28 8 0 

TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 2 94 0 

AATAATAATA TTTTACTGTT GAATAGGTTA TAGTTAATTT TAAAAATGAT ATAACT CAAA 300 0 

ATAAAGAATT GCATGATAAC AACGATAAAA TAGCCATACC ATTGACCTAA TAGGACATTT 306 0 

TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 312 0 

AATCCCATTA ATATGTAAGG AATAAGTATA TATTTTACGC GTGTAGTTAA GTATCTATAG 318 0 

GTGACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA 324 0 

CCAAAAATCA CAATATTACG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 3300 

TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGC ACATATAATA 33 6 0 

GCACGTAAAT ATACGAGTTC AAGTCTAATC TTTTTCATGG AATCCGTCCC ATCTCTTAAT 3420 

TAAATGCTCA AAAGCATCAT CACTAATTAA TATTCTAGGG ATG TAATAAT CATTGGAGTT 34 8 0 

CGGAGTGACT GCTTTTTCCT CTAATGAAAA ACCGTATTTT AACCCAGCTT TTTTGATTAC 3 54 0 

CGGTAATTTA TCGTCATT CA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTTTTT 3 600 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAGCTTCAGA 366 0 

AG CTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 372 0 

TTCAAATTCC CATAACCCAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 37 8 0 

GAGGTTGTGA AAGTTTTCTT CCCCAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 384 0 

TTTATATTTT TTTAAGATTG GATAAGCATT TT CAT AAAT A GTTTCATCCA TATCATCAAA 3900 

GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTGtAATATA AAAATTCTTT 396 0 

CAAQGTTAAA AATTTAGCAT CATGTGATTT TAGCCATTTT ATTTGAGATT CAAATTGTGA 402 0 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 408 0 

CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 414 0 

TTTCAGTTTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA TGATGACCAT CCAGTGTGCT 420 0 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACGAT 426 0 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 432 0 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 43 80 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA ATATAGTACC AATATAAACG 444 0 

AG TAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 450 0 
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TTGCGTAACC ACCTTTCTTA CGTTTTAATG CTTTTGGAAA TGCGACAAGA ACTACTGCTG 4 62 0 

CGTTAATAAT CCAGTATACT GTCGGATACC AACTTACAAA TATGAGTCCA GCCATATTCT 468 0 

TTTTCTCGTA GCGACTATCA ATAAAGAGTG CGACTGTAAA TTGAATAACG TTTATAAAAG 4 74 0 

TCATAGTAAA TGATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 4 8 00 

AGAAGTTTGC TGTTATGAAC AAATAGCCTA AATATAGAAG CACTATATAT ACCCATAAAA 4 86 0 

TCGAGATGAT TTGCTCAAAC ATCAAAATAT ATAAAGGAAA CCTTTTCGTT TTCATTGTGC 4 92 0 

TAAAAAAGTC TCGTAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTTGCTTCC 4 9 80 

AAAGACCTCC CAATGTTTCT GGAACCAACA TCCAACACAT GGCAAGCGGT T CAT ACTTAA 504 0 

TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 5100 

TATCCCAGTA GCCAACGTCG ACAACTGCAC TTTTTTTAAA TAGAGTGAAG ACACCCGAAA 516 0 

TAGTATTGAC TGCGCCAGCA AGTGTCTGAC TtCGCTTAAT ACAGCCAATT AAACTTGCAT 522 0 

ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATT TCGAATTCTA GGATTACCTG 52 8 0 

TAACTGCACC AAGTTTTGGA TCATGTTTGA AATTCTCAAT CATATAATAT GGTGCATCTT 534 0 

GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 54 0 0 

CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCTTGTAA ATCGACGAAA ATAAAGTCAT 54 6 0 

TATTTTCTTT GATTTTATAG ATGAGTTCTG CTGTATTATC TGAACTTCCA TCATTAATGA 5 52 0 

TAATAATTTC TTTCTTCTCG TATTTGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 558 0 

TTTCACTTTC GTTATAACAG GCAAGTAAAA ATGTAATGCC TTCTAATTCA TCCACATTTA 564 0 

TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTTCTCTGGT AAAATAGAAA TAAATTGAAC 57 0 0 

CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 57 6 0 

ATTfCTTTAC CTACCTTTCG TTAGTTAGGT TGTAAGCCAT ATGGTAATTG ATAGTATTTT 58 2 0 

AATTTGCAAT AGATTGTTGT TATAATTAAA CGGAAATATT TGTAATTGCA ACT T AATTTT 58 80 

CCTGTAACAT AGTGTGATTA ATTTTCAGTA GGGGGTTATA AAAATTGAAG GATAAGATTA 5 94 0 

TTGATAACGC AATAACCTTA TTTTCAGAGA AGGGGTATGA CGGTACAACA CTTGATGATA 6 00 0 

TAGCTAAAAG TGTAAATATA AAGAAAGCGA GTTTATATTA CCATTTTGAC TCGAAAAAAA 606 0 

GTATTTACGA ACAAAGTGTT AAATGTTGTT TTGATTACCT TAATAATATT ATTATGATGA 612 0 

ATCAAAATAA ATCGAACTAT TCAATTGATG CTTTATATCA ATTCTTATTT GAGTTTATTT 6180 

TCGACATCGA AGAAAGGTAT ATTAGAATGT ACGTTCAATT ATCTAATACG CCTGAGGAAT 624 0 

TTTCTGGAAA TATTTACGGA CAAATACAAG ATTTAAATCA ATCATTAAGT AAAGAGATAG 6300 
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TGCTGTTTCT TGAAAGTTGG TATTTGAAAG CATCCTTTTC GCAAAAATTT GGAGCAGTGG 64 20 

AAGAAAGTAA AAGTCAATTC AAAGATGAAG TGTATTCGCT ACTAAATATA TTTTTGAAGA 64 8 0 

AATAATTTTT GTTACTAGTT TGTAATAATT AACTTACTTT TGTAAAAAAA GACATGAGAT 6 54 0 

TATTTTTTTA AATCTATATA AAGTTGACAA TACAAAT CGA TATTGAGAAT ATTAAGATGT 6 6 00 

ATATGAATTT TATAAATTAA ATGCAATACA TTAATATAAA TATCAATTGT TGCAAAATAC 66 60 

GATTTGTTCA ATGATTTGAT AATATTATTC TTTATATTTG TGAATGGTTA AGTTTGTCTT 6720 

TGAACATATT ATAAAAGTGT AATGTTCCCC TGAAAAGAAT AAGTTGTCAT CTAATTACAG 678 0 

GAAATCCGCA TAAATTAGAT GAAATGGAAA G TAATAAGT A ATAATTTATT GATAAGCGCC 684 0 

TATG TGATGG TAAATCATGA CATAGGCGCT TTTTTTTATA AGTTAAAAAT GTAAATAAAA 690 0 

ATTATATAAA TTACCCACAT CTTTTTAAAA GGTGTGGGCT TTATTATCAT TAACCCAACT 69 60 

CACAGTGACG GGTTACGCAA GGTATTGAAT TACCGAGTAC GGGCACGCTC GGTGTTGTAA 7 02 0 

AGAG CAAATA ATCAAGTAAT GATGATGCTT CTA CTCGATT ATAAGAAAGC CATGATAGAG 7080 

TACGATGGTA TCTAGTTTTA TTATTAATAG GTTTGGATAT TTAAAGTTGG ACAATATTAT 714 0 

ATCTTGTGCA AAAATATAAA TAAGTTATAC ATAATGGTAG AGAATCATGA TATAATTTTA 720 0 

AACGATAAAA TATTTATATA AATAATTAGA GaAAATGTAG TTGTGTATGT yTTGTGGtCG 726 0 

TTAAA CT AG A TATAATTGTC CGATTTATAA AACATACATA AT G AATACaA TGATTGATTA 7320 

TGTGGAGGAA ACCATGAmAG AmAAGTTTgA TTTAGTAAAA CTATTAAATA TTCTAAAGAA 7380 

GAATATTAAA TTATTGCTTA TTTTACCGGC AATATGTCTT GTAGTAAGTG 74 30 

(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 082 base pairs 
r (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

ATTGTTACTC ATTATTTTTT CAT ATT CACA CAAATGATCT TGTTTATATT TAGCTAATTG 60 

ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTT CTA 12 0 

TAAAAACTTA GTATTCCAGT TGCTTATTAT ATCATTGATG AAAGGCTGAA ATAAAACATA 18 0 

AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 24 0 

TTGAATCGGT GTCTT CA CAT TAAACGGTAA gGAJcnAAAAT AATTTCACGC CTTTTGGTGT 3 00 
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AGGCGTCTGA ATAATTTGCA ATAAAAATGC TATGATTGCG ATAAATAATA TTGAATGCGT 4 20 

AAAGGTTCGA TGTCCAAAGA TCAATCTCAC AAAAAAACTA ATTACCTTAA ACCTTCTGCC 4 80 

5 AATCTTACTT TGAGTGTGAC ATATATCGGG TAATAAGCTA GCTAGAGTTG CTAGAATGAT 54 0 

AACCGTAACC GACGAAAAAA TATCCGTTTG AAAATATTGT GTTGTTAGCG CTCCAACGAG 60 0 

CATGCCGCAT GAAGCATGTG TTTTACCTGT CATATTTGTT CTCCTTTAAT ACTCACATTT 66 0 

10 

TACCACATCC CTAACAAAAA CACGAACATA TTTTCGGGTT AAAATTCATT AGTATGACAC 72 0 

AATTTAAAAA AGTATCACAT AACTCTTGAA AACGATTACA AAATCGTTTA TGATGTATTT 780 

J5 ACAAAATATT TAAAGGATGT GTTTGAATAA TGG CAATGAC AGTAAAAAAG GATAATAATG 84 0 

AAGTGCGTAT TCAATGGAGA GTTG CTG AT A TCAAAATTCC TACAAGTGAA ATTAAAAATA 90 0 

TTACACAAGA CCAAGATATT CATGCAGTTC CTAAATTAGA CAGCAAAGAT GTATCTAGAA 960 

20 TCGGCTCAAC GTTTGGTAAA ACGAATCGCG TTATTATCGA TACTGAAGAC CACGAATACA 102 0 

TTATTTATAC TCAAAATGAT CAAAAGGTTT ACAATGAATT AACTAAATAA ATTGTATAAA 10 6 0 

AaAATCATTC ATGGTGAGGG CTTCAT GAAT GATTTTTTTA ATTGATTCAA CACCCAGCAT 114 0 

25 AAACAAATAC AAAAGGACAA CTGTTCCCAT AATTTTAACA GTTGTCCTTT TTCACATATA 1200 

TTTATAACAA AAGATGTGCC ATCAAAGAAA TAATTGGTAG TGTAATGATT GTTCTAATCA 126 0 

AGAAAATCAT AAACAATTTG CCGATGCTTA CAGGAATCTT CGAACCAAGT ATGACGCCAC 132 0 

30 

CTACTTCAGA CAAGTATATT AACTGCGATA TACTAAGTGC CCCAATAACA AAACGAGTTA 1380 

TATCATTTTG TACACCTTCA ATTAATATAG AAGGTAAAAA CATATCGGCA AAACCGATAA 144 0 

35 TAATCGTTTG AGAAGCCTGT GCCGCTTCAG GTATTTGCAT TAACTCTAAA AATGGAACAA 15 00 

AAGGTTTACC CAATATGACA AAAAAGGGCG TGTAGTTCGC AATAATGGTA GCAATAGTAC 1560 

CAATACTCAT TACTACAGGC AAAATAACAA ACCACATATC AATGACTGTT TTTAATCCTG 1620 

io ACTTAAAAAA GTCAATAACG CCCGGTGCTT TAATACC T AC TTCTGTTGCA GTATCAAAGC 1680 

CATGTCTCAA TGCCGTCTTT CCTT CTGGGA ATGCCTCAGT ACGCGCACTT TCAGGTACCT 174 0 

CCTTAGCATA CTCATCAGGA ATTTTATTTA AAGGCCAAAT TCTTGGCATA ATGACTGCTG 1800 

45 

CAACGAGGCA GGATACTATC ACTGATAAAT AGAAAGCAAA AAATTGATTT TGCATGTGCA 18 60 

CTGTTTCAGC AACTACAATT GCAAAGGTGA TAGAAACTAC ACTAAATGTC GTTGAAATAA 1920 

CTGTTGCCTC ACGACGAGAA TAATATCCTT CACCATATTG TCTACTTGTA ATTAAGACAC 1980 

50 

CAACAGTTCC GTCTCCAATA AATGATGCTA AATTATCTAC CGTCGAACGT CCTGGCAATG 2040 

TAAATAAAGG TCTCATAACC GGTCTAAAAA TAGGACC CAA CATCTCTAAC AAACCGTATT 2100 

55 
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AACTTGAGAA CAATAATCCA CCCGTTTCAT CTGAGTAAAT AACCTTTGAA CCAATTCGTA 22 2 0 

AAAATGTCAT CCATGCAAAA ACAACTGCTA ATATTCGTAA AATTAACCAA C CAATTCTAA 22B0 

CGTTAAAAGC ATTGTTCATT AGCCCGTCAG GTTTCAATTT ATCTTTTAAA ATAGTTGAAC 234 0 

AAATCAGAGT TATGATACCC GATAAAGTAA TTATCGTCAC AATTAAAAAT GGCATTACGC 24 0 0 

CACCTAATAC ATCTTTAAGC ACGCCTGCTA AAAATGCCAC GGGCAACGTT GTTTGCTTCT 24 60 

GTCCATCTTG TTCGACTGGA ATTGGTACTA AAAATAATAA GATACCAATT AAAGACATCG 252 0 

TAATAAACTT AAGTCTCCCA ATAACTATCT CTTTCCTTGA AAAGCTATCC ATAAAATCAA 2 58 0 

TCCATTTCTC TATGTATTCG TTTTAAGTAT ATACAGAATT CTATTCAGTT AACAAACATA 264 0 

TTCCTTATCA TTCTATCTTT CAAAATGTTT ATGTATGCAA AATAATGAAT AATTACAGTT 2700 

ATTAAATATA CGCTATTTCT TGTAATTTTT CAAGATGAAT TCAAAAAAGG TTAAGTACAA 2760 

TTACTGATTT CGTACTTAAC CTTTTTTAAA CTCTAATCAT ATGTTAGTTA TTTCATTCTT 2820 

CGTAATAATA TTAAGAAGTA TGGTGCACCG ATAATTGCAA TGATAACCCC AACAGGAATA 2 8 80 

TCCAGTGGCG GATGAATGCC ACGGGCTAAA CCATCTCCAA ATGTTAACAA TATAGCACCA 2 94 0 

ATTAACC CCG ACATGATAAT AACGTGTAAT GTTTTATTTC CTATTAATTG TCTCGCAATA 3000 

TGAGGTGCAA TTAATCCTAA AAAGCTAATA CCACCGACAA CTGAAATTGC GGATCCTGCT 3 0 60 

AATATTACTG CTAAAATTAA CAATAGCATT TTAATAGTTT TAACTTTTAA ACCGAGTGCG 3120 

GTTGCAACAG CATCACCTAG ATTCAATACA TCTAATTGAT AACTCAATAA AATGATGATT 3180 

GGTATCGTTA TTAAAAACCA AGGTAATATA GTATAAATAT TCGACATATC ATGTCCATAT 3 24 0 

AGACTACCTG TCAACCAAAC AAGCGCTTTG TTTGCTTCCA GTGGATTTCT GATTAATAAG 3300 

AACTGCACAA TCGCCGTACA TATTGCGCCT ATTGCTAAAC CAATTAAGGC AAGCTTTGAA 3360 

CCTTIAACAT CATATTTTGA AATTAAAAAT GATAAAAATA AACTTACTGC AAAGGCACCT 3420 

AAGAATGAAC CTATAGGTAA TACAAACAAT GGTGCTGTTG GAAAGGTCAT AATAATAATC 34 8 0 

ACAGCAGCTA AACTGGCACC TTTAGAAATA CCTATAACAT CAGGTGAGGC TAACGGGTTT 3 54 0 

CTTATTACAG CTTGTATAAT TGCACCTGAA ATAGCCAAGC TACTACCGAT AATAATACCA 3 600 

AGTAATGTTC TAGGTATACG ATACTCATTT AAAATAAAAT CATCTTGTGT AAAGATTCCC 3660 

TTAATAGCAT CAATCGGATG AATCATGACA GACCCTACAC ATAAACTTAT GAATATACTC 3720 

ACAATTAAAA GGATTGTGAT TAAACTATAA CGACGTATAA TTTTCGTTGT CATCATATTC 37 80 

TTTTCACCCC TTTAATCGTT ATAAATAAGA AGTAAAGTGC AC CTACGAAT GATGTAACAA 3840 

TCCCTACTGG TGATT CATAA GGATATGTAA TTAAACGACT TAATACATCT GATAGTAGTA 3 900 
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AGCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 4 020 

CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT TTATTTATAT 4 08 0 

IT 4082 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 2: 

TATTAGAAGG TCGTTCGGAT GAACAATTAA AAAATTTAGT TAGCGAAGTA ACTGACGCCG 60 

TAGAAAAAAC AACGGGGGCA AATAGACAAG CAATTCACGT TGTTATAGAA GAAATGAAAC 120 

CAAACCATTA TGGTGTGGCT GGCGTAAGAA AGTCAGATCA ATAATTCTTC ATAAGATGCA 180 

TGCCAATTAA TTCTTTGAAA ACGAACAAGG CGACTTCTAT CTGaGTATGA TAGAAATCGC 240 

CTTGTTTATT TTTAATCTTC ATCTAAAAAG TCTTTAATAG CTTGTTTATT TGTGTTTTTA 3 00 

TTAATCTGTA ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA GTTCTTGATT 360 

GGCaCAGTCA ATGACTTAAC ATCTTTTTCA CCT CGGATAC CaAAACTCAA ACCTGTTTGG 4 20 

AAAATCCCTG AAT CAGGAAT GTTTGTATTC ACATAGCCTC TTAAAATACC TGCAACTTTT 4 80 

GGTAATTTAA CAACTGTTCT AAAATTAACC ATTTCTTTTT TCAATGTTTG CATCACTTGT 54 0 

TGCTGACGTC GCACGCGTCC GAAGTCACCT TCAGGGTCGT GACGGAATCT TGCATAACCA 600 

AGTAATTCTT TACCATTCAA CCTATGGTTA CCCTTTTTCA AAGATACACC AATATTTTTC 66 0 

GACATATCTT TTTCGACATT AATTGGTACA CCTTCAGGCA TTAATTCATC AATCATTTTC 720 

TCAAATC CAG TAAAATCAAC TACTGCATAA TATTCAGGAT TAATTCCTAA ATTTTTATCA 780 

AGTGTTTTTC TAAGTAGCTC TGGACCACCT AAAG CGTATG CTGAATTAAT TTTGTGTTTT 84 0 

CCATATCCTG GAATAT CTGC ATAAATATCA CGCATGACAG ACATCATTTT CATCTTTTTA 900 

TTGATAAAGT CATATTGAAC AACCATGATA GAATCTGTTC TTGATTGTCC AC CTTGTGCT 9 60 

TTATCTGCAC CGAGTACAAG AATAGAAATT nTACCATCAT TTTTTACTGG TCCATTAAAT 1020 

TGATGTACTT TAACATCTTT CGCATGTTTC TTGGCATATT CTACACCGCT ATTGTAACTA 1080 

TGTACAATAT ATACAACTAA TGCCGATAAG TAAAATTACA ACAATCAGAA GAATGATAGG 114 0 

TAATT 114 $ 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

TATGGCTCAT CATTAATGCA CGTATCGGGT AGCGTTTACC ATTGATAAGT GCTTCATGTT GO 

TAG CACGAGT TCTTAAAATT CCATCGCCAT AACCGATATC AACTACAGCT AATTTTGTAT 120 

TGTT T T T AGT CACTTCAAAG GCAAAGCTAT AAC CGCAATA ATCACCAGCT TGTACTTCGC 180 

GCACTTGAAT AACATGTGCT TTTAAAGTTA ATGACTGAAC TATATCATGT TGATTCAGTG 24 0 

AACTATATGG TCTTGAACCG TATAACGCAA TACCTACACG CGCATGTGTA TGGTGGGGTA 300 

GTAATAyyyg TCCTTCCCGA TAAAAACTCG CACTATTTTG AG CATGGATT AGGTCGAACT 3 60 

GATAACCTTC AGATAAAAGT GCTTCAACAA TTTCCATCCA TTGTGAACGT TCAACATTAT 4 20 

AAtCTGACAC AT CGAATTCA TCAGCATATC CAAAATGGGk CCATAAACCA CTAATAATCA 4 80 

TTTTTGCATT TTGATTATGA TGGTGATCTT TCAATACTTC TTTAATTTCG TTTAAATCTT 54 0 

TAAATCCAGA CCGATGTAAT AAATTTTCAA ATTCTAAGTG AACATGAATA CCAGCTAAAT 600 

CATTTTTATG GTTATAGTAA TATGTCAACG ACGGCAAAGT CATGTGTATT TGATGTTCAC 660 

GGACTAAATC AAACTCGTAA AC TG CATT CA TTAAAAAGAT TGTTGCATCT GGAGCAAGTT 7 20 

GTCTAATTTG AATTGCTTCT CGTAGTGATG TTGTGCTAAA TGTATCTATA CCTGCATGGA 7 80 

TAAACTGAGT TACAG CAAAT TCTAGGTCAT AGTGATATGC ATTaTTTTTA ACAACTGCCA 840 

TTAATGGCTG ATTGTTTTTG ACTGTGATTG CATTTTGTAA AAATATTTTC TTATTTACAG 900 

ACCATGTTGC TGTCAATGTA TTACAC CTCT TTGTAATTAT TTAATAAATT TTCGTAAAAA 960 

TTAACCACGT TTATTAACAC TTTTTCATCA AAATTTAAAT GTGATGTGTG CAAACCAGTT 1020 

ACAAAACCTT TATCTTCATT TCGTGTTCCT ATAAAAACAA AGTAAGCTGG AGCTAGTTGT 10 8 0 

TGACCATAAA AACTAAAATC TTCCCCAAAT AAGAATGGCG TTGGTTTGTC ATAGACATTT 114 0 

AAATCAGCTT TTATTAAGGC GTCCTCTATT TGAGTACGTA ATTTCGGACT ATTGATTGTA 1200 

GGGGGATAAC CTTCTGCAAA TTTAACTTCA CAATCTACAT TAAACAGAAG CTTGACACTT 12 60 

TCTGCTATCT TGTGCATTTG ATTTTTAACG ATTGTTAAAT CATCAATATC ATATG TACGA 132 0 

ATAGTACCTT CTAAATAGCC ATTACTTGGT ACAGTGTTAA TCGCTTCACC AGCTTTAAAA 138 0 

TGACCAATAT GAACAATATT TCGTTTCAAA CCGTTAAGGT GAAATTGTTG AATTTGTGAT 144 0 
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ACATGACTTG ACAGGCCTGT TAAGAAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 15 60 

TCATCTCTTA TCACTGCAAT GCCTTCATCA GCAAATGGGT TAACATGAAT AC CAAATAC C 162 0 

GCTTCAATTG GATACTTATC AAAGGCACCG GCTTTTATTA ATCGATTTGC ACCGCCACCA 16 80 

GTTTCTTCTG CAGGTTGGAA AATGAAAACG ACATTTTGCG GTAATTGACC TGCATCTTGC 174 0 

ATGTCTTTGC AACGTTGTAC AAAAAGCATT AATG CAGTTG TATGACCATC ATGTCCACAA lSOO 

GCATGCATCA CATGATCAGA TTGACTG CGA TAAGGCACAT CATTTTCCTC TAAAATAGGT 18 60 

AACGCATCAA TATCAGCTCT ATACG CTATC GTATGTGAGC CATTACCTT C TAAGTATGCA 192 0 

ATGACGCCAG TTTCCAATGG GCAATCGTAT TTAATATTTA AACTATCTAA AAACGCTTTA 19 8 0 

ATATAAGCAG TTGTTTCAAA TTCATGTAAG CTTAATTCAG GATGTTGATG TAAATGACGG 2 04 0 

CGATGTTTCG TAACAAATTC TAATTCATTC ATAATTATCA ATCCTTTGTG TTAAATTACT 210 0 

ATATAAATAG TGTAACGtAT TTCGAAATTT GTGATCATAA GTTTATTCAA TGCTAAACAA 2160 

TAAGGTTGAG ACATAATCGT ATCTCAACCT TGAAATTATT ATACGTTGAC GTCACTAGTC 2 22 0 

ATTCAGTTTT CTTAATGCTG CTACAATCTC TTTTTTAGTA TCTTGTACTT CAGAAGCTTG 2280 

CTTAATCACT TTTG CAGGTG TACCAGCAAC AACTGCACCA GCTGGTACAT CTTGTGTCAC 234 0 

AATCGCGCCA GCTGcAACAA TAGCACCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 24 0 0 

ATTTGCACCG ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGGTTCAAT 2460 

CACACCTGCT AATACTGCGC CAGCCCcTAC ATGTACATTT TT AC CAGTTG TAGCACGACC 2 520 

ACCGAGAGTA GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 2 580 

TGTTGCGCCC ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 2 64 0 

ACCTGGTTCA ATTCGTGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTGCG 2700 

ACGATCCATT TCAATTTCTA TATCTT CGAA TTGACTACCA TATGCTTCGT AAAAAGGTTT 2760 

CCAATCATCC GCTTCACAAA AGATTACTTT AGATTGTTCT GAACCAAATA CTTTAAAACT 2 820 

TTCTGGATAT GTGATGCCTT CAAAATTAC C ATTTAAATAT ACTTTTATTG GTGTAGACTT 2 880 

TTTAGCATCA CTTATATATT GAATAATTTC TTCAGCTGTT AAATGTTGTA CCATAAAATA 2 94 0 

ATCGATCTCC TTTAATATGT TTATAAGTTG TCAAACGTAT AAAAGCCGTT TGGTTTATTA 3 000 

ACTAAGCGTT CTGCTGCTTG TATTGCACCA TTCGCAAAAA TAT CT T T T G A TTGTGCACGA 30 60 

TGCGTGATTT GAATCGTTTC ATCAGTGCCA GCAAATAGAA CTTCATGTTC ACCGACAATC 3120 

GTACCTCCAC GAATAGAATG TATACCAATA TCTTGTGGCT GGCGTTTTTC ATTTAATTCA 3180 

TGTCTATCAT ACACAGGTGT TACATTTTCT TT CAAAG ATA CGATCACATC ATACAATTTT 324 0 
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TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 33 60 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 34 2 0 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTT CAT CT 34 8 0 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAAT CTATTG CAACATCGGC ACCTTTAACA 3 54 0 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 36 0 0 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 37 2 0 

GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 378 0 

CAAATCCTAA ATAACTTGTT AG AG CTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 3 84 0 

CTGATAACAG TGTGC CGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3900 

CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 3 960 

GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4 020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 408 0 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AG TTTCTGG T TCAATTGTCA 414 0 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4200 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4 260 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4 32 0 

CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 43 80 

TTAGAATGCG TTCTT TT TCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 444 0 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

TATTAACTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC SCCAACACCC TCAAATAAAT 4 560 

GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTTGTT CCAATACTTG TACAGCATTT 462 0 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTT CTAAT 4680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 474 0 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4860 

TGACTATCTT GAACAGGCAC ACGTGCGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4 920 

AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4 98 0 

AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 504 0 
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TGATATGTTG TATATGCCAC TCGTTTTAAA C CATAAGCAT CTTGCAATAC TTTTAGAGGT 5160 

ACAACAGATT GAATCGTAGA GCAGTTTGGA TTGGCAATGA TACCTCTTGT AAATGTAGGT 52 20 

5 

TCATTGACTT CCGGAACGAT TAAATCAATA TCTTCTGCCA TACGCCATTG ACTTGAATTG 528 0 

TCTATAACGA TTG CACCAGC TTTTTCAAAA AG TGGGG CAA AGTGTTCGCT TGTACCGCCA 534 0 

CCAGCACTCA TTAATACATA ATCGAAATGT TCACTTGCAC GAGCAT CAGT TAATTCTTGA 54 00 

W 

ACTGTATATG TTTTT CCTTG AAATTCAACT TCTTGCCCTG CAGAACGTGC TGATGAAAAT 54 6 0 

AATACTAATT CATCGAAAGG AATATTTTTA CGATTTAATG TCTCCAACAT TTTTGTACCT 552 0 

15 ACTAATCCTG TTGCACCCAC AACTGCTAAC TTTGTCATAA CTTGTCACTC CATTTTATAA 55 BO 

TAATTTCCaA TTTTTAGAAT ATTTTAACAA TCATTTTACC ATTAAATGTT AAATGCGTCA 5 64 0 

TATAGTTTTT CTACCGCTTG TTGCCCATTA AAATCATCAA TGACGTATGA AATACTTATT 57 00 

20 TCAGATGTTG TTGTTTGGTA GAAAGGTATA TTATTTTCAA TTAATGTCAA AAATGCTTTT 57 60 

GATGCCACAC CTGACATATC ACGCATGCCT GAG C CAATT A ATGAAATTTT GACATAATGC 5 820 

TCATTGATTT TATAAGCTAA TG CTT CAT AT TGATTCTTTA ATGTTTCAAG AATCATAGAA 58 8 0 

25 

ATTTGATGAA AATCACTATC TTTAATCGTG AAGGATAGTT GTAGCCCATC CAAGTTGACG 5 94 0 

ATTTGTGAAA TCATATCAAC ATTTACAGCA C CTT CTT CAA GTTCCGTAAA TAGTTGGGTA 6 0 00 

AGTAGCTGAT TGTCAGGTAG GGGATAACTA ATTGTTACAT GCATCATATG TTTATCCAAA 606 0 

30 

GCCACACCAG TAACTGCTTT TTTCTCTAAT ATTTCTTCAT TTGACATAAT CCATGTTCCT 6120 

TTCACGTTCG ATAAAGTTTT TCCTAAATAT AAAGGGATAT TATAGTTTTT AGCTAATTCA 6180 

35 ACACTTCTTG TTTCAAGTAC ACCAGCACCT AAAGCGCTCA TTTCCATCAT TTCTTCATAT 624 0 

GAGACGATGT CTAGTCGTTT AGCCTTTGGT AAAAGTCTTG GGTCAGTGGC ATACACACCA 63 00 

TCAACGTCGG TATAAATTTC ACAAGGTATT TGATTACTAA CAGCAAGTGC CACAGCGGTC 63 60 

40 GTATCAGAAC CACCTCTGCC TAAAGTTGTT AATTCCTGAT GTTCATTGAT GCCTTGAAAT 64 20 

CCAGCAACTA CTAAAATATC GTTTTCTTGA AAGGCTTGTT CAAATGTTTG AGGATTAATT 64 8 0 

TGAGCAATTT TACTTTTTAA ATGATGGCCA ATGGTTTTAA TACCCG CTTG ATAGCCAGTC 6 54 0 

ATTGCTTTGG CATTCATACC GATATCATTT AATACCATTG ATAAATAAGA TACAGTTTGT 6 600 

TGCTCTCCGG TTGTCAATAA TAATGCCAGT TCTTGTTGTT TTGGTGCTTT AGTCAAGGTT 66 60 

so GATACATTCG TCATTAATTG ATCTGTTGTG TTACCCATAG CACTTACAAC GACAATTAAa 6720 

TTGTTCATCT TGATTGACTC GCTCCTTTAA CATTTCAGCG ATCCTTTTTA TTTTTGTAAA 67 80 

ATCACTGACG GATGATCCGC CAAATTTCAA CACACTTCTT GTTAC CAT AT AATCCTCCTA 6 84 0 



1069 



EP 0 786 519 A2 

TAATCTATAT ACAAG TGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 6 960 

AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 7 02 0 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 707 5 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 





AGACGTACTT 


TGTGATTmCG 


AAGyrCGTAC 


TGmAGCACTT 


GTCGACGTTG 


ATGTACTTGT 




20 


TGAACCTGAT 


TGACTAGTAC 


TTTGTGATAA 


TGACTTACTA 


TCAGAATCAG 


ATGTACTTTG 






TGAATCACTT 


AATGATTCTG 


ATGTACTACC 


TGACTGAGAC 


GTGCTCATTG 


AACTACTTAC 




25 


GGACATTGAT 


TTACTGTCTG 


ATGCAGATAA 


TGACCCACTT 


GTACTGATAG 


AGTCACTTAC 




TATCTCTGAA 


GTACTCATCG 


AGTCTGATGT 


ACTTGTTGAG 




ATGCTGCTAT 






GCTTAGTGAT 


C CAGAAACAG 


AACCACTTGT 


GCTCGTCGAA 


TCGCTCAATG 






30 


ACTCATCGAT 


TTTGAATCAC 


TTGTACTTAA 


TGATATTGAT 


GTACTTTGTG 


AATCTGATTT 


420 




GCTTGTTGAC 


G CACTTTG AG 


AGTTGGCTAT 


G CTATTTGAA 


ATACTGATAG 


AGTCCGAGGT 


480 




GCTAGCTGAC 


TCGCTCAATG 


ATGTTGATGT 


ACTAATTGCA 


TTCGATGTAC 


TGTCACTTAA 


540 


35 


TGATGCTGAT 


GTACTAGACG 


ACCCTGATAT 


ACT CGTTG AT 


AAGCTTTGTG 


ACTTAGACAA 


600 




GCTTCCTGAT 


GTACTCATAC 


TTAATGAGTC 


ACTGAGTGAT 


GTTGATGTAC 


GCAATGAATC 


660 




AGATSTACTT 


GTTGATAGAC 


TTT CGGATTT 


TTCAGTACTG 


CTAGAGTTTG 


AAATAGAATC 


720 


40 


GCTTAATGAT 


GTTGATTTAC 


TAGCTGAATC 


CGACATGCTT 


GATGATACAC 


TTTGTGAATT 


780 




CACTAAACTT 


GTGCTTGTTG 


AGCTTGATAC 


ACTATTACTT 


TCAGATGTGC 


TTAATGACTT 


840 




AGATGCACTC 


ACAGAATCAG 


ATAGGCTTAC 


ACTTGTCGAT 


TTCGAGGTAC 


TAGCTGATGT 


900 


45 


AG AT AC CA CA 


ATCGATCCTG 


ATGTACTCGT 


TGATGCACTT 


TGTGAGTCAG 


CTTTACTTGT 


960 




TGACACACTT 


TGAGATTGTT 


GTGTACTTCC 


TGATGTTGAT 


ACGGAATCAC 


TCATGCTATT 


1020 


50 


TCTTGTTACT 


TCATATTTAA 


AAGTTGTCGT 


CGTTTTGTTA 


CCGCTCGCAT 


CTGTAGAAAC 


1080 




GATTGATATA 


GTACTTGTAC 


CAATGTTTGT 


TGGTGTACCA 


CTAATAGTAT 


TATTTGTACT 


1140 




ATCAAATGTT 


AGTCCGGATG 


GCAATCCAGT 


CACTGTATTC 


GTCACCGCAT 


TTC CACTGTT 


1200 
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ATTGGTGTCA CTGTTGGTGC TGTCGTATCC ACAACATTTA TTGTAAAAGT TGTCGTCGAT 13 20 

TTGTTATTTG CTTGGTCAGT AGACACAACT GTCACTGTTG ATTGACCAAT TTTTGTTGGT 13 80 

GTCCCAATGA TTGAATTCGT TGCACTATCG TAACTTAATC CGCTTGGTAA TCCTGTAACT 14 4 0 

GTATTTGTCA CAGTCCCAGT ACCATTATCC GTTGTAGTCA ATACAATAGG ATTCATTGTT 15 00 

TTACCCACTT CTATGGTTTG ATTGCCTACA GTTACAGTTG GTGCTTTTAC AT CAGTAAAA 15 60 

TAATATGTCA CTGATTGTCC AGCATTCGTC ATTTTTACAG TTTTATTTGT ATCATTATAA 1G20 

GTTGACGCAT ATGAACTATC GACGGACGTG TAGTTATATC CTTTAGCAGT CAATGCAGAT 16 BO 

TGCTGaTTAT CGATTGTCAC GACTTGATCA ACATTTCCTG AATATGTTTT TGGTGGAATA 174 0 

ATATCTTTAC CTGTTGTTAC ATCAACGTAT CTCACTTGTG TAnCAGCAGA CTCTGTATAT 18 00 

TCGAATGTTC CAAATTGTAC TTGTTGTAAA TTTGTCGCGC CACCTGTTGA GGCTGTCATT 1860 

GATAATGAAA AGTTGGTCGT ACCACTTTTC GCAATCCAAT CTGAAATATT ACGTGTCCAT 1920 

GTTTGACCTG CATATTTGAC AGTCATAACC TTTGTATCAC CATTATAGTT AATATCAAAA 198 0 

TCTTGGaACG TGTTATTTGT AGGTTGaACA TTTAACTT CG CAGCATTATC AG CTG TTGAA 2 04 0 

CTTGATGTAT ACGTTGTCGC AACACCATAA CTATCTGTTG TTACAAATGC AC CAAACGCA 2100 

CCTCCACCAG CTACATTAGA TGGGTCAGCA TTCGCCTTTG CAGCTGAATT TGGTTTAGAT 2160 

GTATTGTGAT ACGTATCCAA TTTGAAGCCA AATGCGTTAC TTAAGCCACC AATACCTACT 2220 

GCGGCACCGT TTAACCCTGT TTCACCTAAT ACACCTGGTG AAAAGGCAAA ACCGATACCA 22 80 

TCTCCACCAT TTCCATGCCC TTCATATTTG TTACCTAAAT TTACTTTTCC AGAAAAATGA 2 34 0 

AAACTCTTAT TAGAGTCAAT ACGTGTTCCT AATGTAATAG CACCTTTTTG GCTGTATGCA 24 00 

TCCTGTGTTA ACGTCACAAT ACCGGTACTT TGATCATAGG TAGCATTACC TGACGTTGTC 24 60 

ATATATTGTT TTAAGTTATC TTTATTAACT GTAATTGTAT TAGCAGTTAC TGCGGTTGTC 2 520 

GTCgCTGctG ACGCAAATGT TGACATAGCT AAGCGACTGA AAGTTCGAAG TTTTACTGGT 2580 

GCGGTG CTAG TTGACGTTGT GCTAGTTTTG TTTAAGTTGA CCGAAGATGG CGTTGTGCTT 2 64 0 

TGTGAAGTGT TATTTGATGC AGTACTTTGA TTTGTTGATG TATTAATTGG TTGTTCTGTA 27 00 

CTTGAAGTTG AAGCTACAGA TTTAGTATCA GAACTTGATG TAGTATTCTT TGAGGATGTT 27 60 

GATTCTGATG TAGATGTCAA TTTCTCTTGT TGATTGCTTG TACTATTAGT TGTCGAAGTG 2 820 

ACCTTTTCAG ACTTTTCACT TGAGACTGTG TCACTATTTG ATGTTTGTAC CGAACTACTA 2880 

TTTtTCGTTA CACTTGTGGA ATCGGCTGTT GAT GTTGATG CTTCGATTGT CG TTGAGTTT 2 940 

TGATTACCTA CTGTTTCACT TTGTGTGTTT AATTCAGAAG TTAATGGTGC AT CAGAAGCC 3 0 00 
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GTTTTCAGTC CGTATCCCGT CATTTTTTTA 
CTATGACTAA TAAATGGTAG CCCCATAATT 

5 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT 
TCATGAAATG CTTTCTGTCT TTTACT CATG 

w TTAGATTATA ATATATGCCA CTATTCAATT 

TCATTTTGTT AAAATTTGTA AAAATTCaTT 
TTTTTAAAAT TACTTTTTCT CGTTTATGCG 

15 TTGGCTTTAT GTTTAATCAT TATAATTGTT 
ATCTATACCA TCCACCTTTT ATTTATAAAT 
TATG CAAAAT ACATCTTTAA TATTAAAGTA 

20 AATATATGTG TAATT CTAAA AGATTCTTCT 

TTAACATTTT CATTTATAAA GCGAAAAAAT 
GATGCATTGA ATTCACTAAA ATGATTAAAT 

25 

GAAATTTCTT CCTT CTAAAC CTGCTAACTC 
AAATATCTCA TTCACTACTG TGTAATCGTA 

3o CTTAATGTCC AATTT AC CAT TATCAAGAAT 

TCTTCCTATA ACAATATCTA CGGTAGATAC 
AACGTACTCA CATT CAAAAT GAACTGGCGA 

35 ACATTTTTCC TTTGTTACAC CTGCAAAATT 

CGATGATAAA TTAACTGCTT CTCTTAAATC 
TGTCTCTTCA GCATTTTTCA CTGTATCTTT 

40 TGCTGCGAAC ATAACCATAG GCGGATCCCA 
TAAATTATCT TTCCCATCTT TCGATACAGT 
ACTACTTTTA AATGGGTCGT GCGGTAAACC 

45 

ATTCATACTA TCTTCACCCC TTATAAGTAA 
AGCGTCTAAT ATAAATAAAC AATTT ATTTA 
50 TATAATTTGC GTTCTTGATT CGAAAAATTC 

ATATCTTTTG TAAATTTAAT TAATTATAGT 
AAGTGAAATT GAAAAAGACA GCTATTATGC 

55 



CTAATGCTTT GATTATCTTG ACT CACTAAA 3120 

TTGAACATTT CTATTTCTTT AATTCCGGAT 3180 

CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 3 24 0 

TAATAACTCC TTGTATTATC TTTACATTCA 3 3 00 

TAATACAACT CTTTTTTGAT ACAAAAATAC 3 36 0 

TTTATTCGTC TAAATGTAAT CGTTTTCATA 3420 

TATAATCTTT TTTTATATAA ATTTGGCTAA 34 80 

TCGTTTTTAA AATAATTATT GTATTAATAT 3 54 0 

AGTTAATTTA CAACTAAACG ATAAATATTA 3 600 

ATACCAATAT TTTTTCAATA AACCTAGTGT 3 6 60 

TTAAAAATAT AAATACCACG A CAT ATT GCT 3 720 

GCATCGCTAC TAAGTTGAAT GTTTAGTAAG 37 80 

TACTTATATC TTTTCATCTG ATTGATTATC 3 840 

TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3 900 

ATATCCTAAT CTGGCAATAG GTTTAATCGA 3 960 

AACCTTATCG TCAATATGAA CTTGGGCAAC 4 020 

TGGATCTCCA GTTGGAATAC GAATCGTTTG 4 080 

TTCTTTTACA CGATATCCTG GAGCTTCTAT 414 0 

AAATTCATCC TCTTCTGGTG GCAATGCTTT 4 200 

ATACGTTGCC ATATTCCACA CAAACCAACC 4 26 0 

ACGTTCGTGA TCAC CAAGAA CGGATTGATT 4 320 

AGTTAAGTTT TGATACTGAC TATAAGGCGC 4 3 80 

AGAGATCCAC CCTATTGGAC GTGGTACTGT 444 0 

ATGACTTCTT ACAC CTTGTT TTGGCGAATA 4 500 

TTACATTTAA GGTTACGCCC TCTTTACATA 4 560 

TAAGTAGAAA CTATATATGA CGTGGTTGCT 4 620 

AGATAAGGAT TTATACAATT AATATTTATG 4 6 80 

TACTTCAATC ATGATTAGTT TATAATAATA 4 74 0 

GATGAGCGAA AAACTTCAAG TAAAACAAGA 4 8 00 
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TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 4 92 0 

GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 4980 

TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATG CTT GGAACCTTCA 504 0 

AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 510 0 

AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 516 0 

10 

GTATTTCCAT C 5171 
(2) INFORMATION FOR SEQ ID NO: 265: 

, 5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35B9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

CTACACACTA AACCTATTTC AGTTATGGGT GGTACAGTGA TTCTCTTTTC ATTTTTAATA 60 

25 GGTATTTGGA TTGGTCATCC TATTGAAACA GAAATCAAAC CACTTATTAT TGGTGCGATT 12 0 

ATTATGTACG TACTTGGGCT TGTAGATGAT ATCTACGATT TGAAACCGTA TATAAAATTG 180 

GCTGGTCAAA TTGCCGCTGC CTTAGTAGTT GCTTTTTATG GTGTGACTAT TGATTTTATT 24 0 

30 

TCGTTGCCAA TGGGTACAAC GATTCATTTT GGATTTCTTA GTATTCCAAT TACTGTGATT 3 00 

TGGATTGTTG CTATTACAAA TGCAATTAAC TTAATTGATG GACTCGATGG TTTGGCGTCG 360 

3S GGTGTTTCkG CAATCGGACT CATTACAATA GGGTTCATTG CAATTTTACA AGCTAATATT 42 0 

TTCATAACGA TGATTTGTTG TGTTTTATTA GGCTCTTTAA TTGGGTTTTT ATTTTACAAT 4 80 

TTCCATCCTG CCAAAATATT TTTAGGTGAT AGTGGGGCTT TAATGATTGG ATTTATCATC 54 0 

40 GGATTCCTTT CTTTACTCGG ATTCAAAAAT ATTACAATTA TTGCATTGTT CTTCCCAATT 600 

GTTATCTTAG CAGTTCCATT CATTGATACT TTGTTCGCAA TGATTCGACG TGTGAAAAAA 660 

GGGCAGCATA TAATG CAAGC TGATAAATCG CATTTGCATC ATAAACTATT AG CTTTAGGC 72 0 

45 

TACACACATA GACAAACAGT ATTATTAATC TATTCAATCT CTATTTTATT TAGTCTTTCG 7 80 

AGCATTATTT TGTATGTATC GCCACCATTA GGTGTTGTAT TAATGTTTGT ATTAATCATA 84 0 

TTTAGTATTG AATTAATTGT TGAATTTACA GGATTAATAG ATAACAACTA CCGACCAATA 900 

so 

TTAAATTTAA TTAGTCGTAA GTCATCTCAT AAAGAGGAAT AGGGAATGAA AGCATAGCTG 960 

TATGGGATAA TTTGTATTAT ATGGCTTTAC TCTTTACAAT TTTTTTGTAT TAAATTTCAA 102 0 

£5 
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ATTTACCGTC TTATGATAGT GCTTTTTATT TTTATTCAGT TGGTATATCG AAAGGTAACT 114 0 

GCTTTGGAGT TTCTTCAGTC AAATCGAAAT TTCCTGCAGT CATTTGATTT AAAAAGTTAA 12 00 

TAAACGCTTC ATAGTCACTT TTAACGACAT CGATATAGTA GCTTACCTTA TCAGTGTAAG 1260 

TTTGGTTTCT TAACATAAAA TGAGTTGAAG CTAATTCATA TTCAAATTTA CCAGTTTGAT 1320 

CATAATTCAG TGTTACTATA CATGGTACTG CTTCTCGTAG TTCGACACGC CCGATATCAT 13 80 

AAATGACGTC TCTAACAGCA CCGCTATAGG CGCGAATTAA ACCGCCACCA CCTAATTTAA 144 0 

TACCACCAAA ATATCTTGTT ACTACGACAC ACGCATTATG aACATCGrGC TTTTTTaATA 1500 

TGTCTAACAT TGGGaCACCG GcAGTTCCtG TCGGTTCACC ATCATCATTC GCnTTTTGAA 1560 

TATTCATTtC AGGTCCAATA GTATATGCAG AACAATTATG AGTGGCATCT TTATGTTCTT 1620 

TTTTTATTGC AGCAATAAAT GCTTTaGCTT CATCTTCATT TTGAACAGGT TTGATATGAG 1680 

CAATGAATCT TGATTTACTA ATCACATTTT CAATAATGTG TTCTTTTTTA ACAGTAATGA 174 0 

TATTTTGTGT CATAATAACT C CTTAATTCA TAAGCTTAAG ATTATTTAAT CTTCATTATA 1800 

CACTGAAAAT GACATGACTA TAAATCGTTT GATTGCCATT TTCTTTTTAA CTGAAATATT I860 

GTATCATTGC TATGAGTATA TTTTAGGAGG ACGACTATGA AAATTGCTGT GATGACCGAT 1920 

TCTACAAGTT ATCTGTCGCA GGACTTAATC GATAAATATA ATATT c AAAT AGCGCCATTA 1980 

AGTGTGACTT TTGAAGATGG CAAGATTATA CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 204 0 

GCCATTCAAA CATTAGAAAA GAAAGTATTA GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 

TTATTTGTCA TAAATGGAGA T CATTT CGAA GATGGT CAAG CGTTATA CAA AAAGTTACAA 2160 

GATGATTGTC CTTCAGCTTA TCAAGTAGCA TACTCTGAGT TTGGTCCAGT TGTTGCAGCA 2220 

CATTT AGGTT CTGGTGGATT AGGTTTAGGC TATGTTGGCA GAAAAATAAG ATTAACATAA 228 0 

TTATAAAATT TTAATAAAAG AGTCTATATT GTAATTGGAA ATTATCTCTC GTATACATGG 2 34 0 

CTTTAAATGT TCATCATTTG AAAGCCAAAA TGCTAAAGAT ATAAGAAAAT CATTATAATA 24 00 

TTAGGCTCTT TTTTACGTTG AAATGAGGTT TTAAGCATTA AACATTACGG GAAATTAATT 24 60 

CATCCTCATA CTTCACTTAC TAATGAAAAA ATTAAAAAAG AAGTAACAGG TGTCATCAAA 2520 

CAAAATTCAA ACTATTATTG TGTTCAATGT GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2580 

TATGATTCCT CAGTACATTC CAAGAAAATT GTATATTGCA GAAATTGTAT ATCACTGGGT 2640 

CGAATGGATA ATGTAACAAG ATATAAAATA ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 2700 

TATTATCATC TCTCATTTGA ATTGTCGGAA CAGCAGTCTT ATGCCTCAGA ACATATTGTT 2760 

CGAGCCATTA GAAAGAGACA AACGATTTTG TTATATGCCG TAACAGGTGC AGGTAAGACA 2820 
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TCACCACGTG TAGATGTTGT TGTAGAAATT AGTAAACGTA TTAAAGACGC ATTTCTTAAT 294 0 

GAAGATATAG ACATACTACA CCAGCAATCA AGACAACAAT TTGAAGGGCA TTTTGTTGTA 3 000 

TGCACAGTGC ATCAACTTTA CCGATTCAAA CAGCACTTTG ATACTATTTT TATTGATGAA 3 060 

GTCGATGCCT TTCCTTTATC AATGGATAAA AATTTACAAC AAGCATTGAA GTCATCTTCT 312 0 

AAAGTTGAAC ATGCAACAAT TTATATGACA GCAACACCAC CGAAACAACT TCTGTCAGAG 318 0 

ATTCCCCACG AAAATATAAT TAAATTGCCA GCTCGCTTTC ATAAAAAATC ACTTCCAGTT 324 0 

CCTAAATATC GTTATTTCAA ACTTAATAAT AAGAAGATTC AGAAAATGTT ATACCGAATT 33 00 

TTACAAGAT C AAATTAATAA TCAACGTTAT ACACTGGTGT TTTTTAACAA TATAGAAACA 3360 

ATGATTAAAA CATTTTCGGT TTATAAGCAG AAAATTACTA AATTAACATA CGTCCATAGC 34 20 

GAGGATGTTT TTCGCTTTGA AAAAGTTGAA CAATTAAGGA ATGGACATTT CGATGTCATT 34 80 

TTTACTACGA CAATATTAGA ACGTGGATTT ACAATGGCAA ATTTGGATGT TGTTGTTATC 354 0 

GATGCACATC AATATACTCA AGAGGCTTTA ATACAAATTG CTGGACGTG 3 589 



(2) INFORMATION FOR SEQ ID NO: 266: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
TTTCCAAGAA GGcTTgAAAA AtGTTwCaAG TGGTGCGAmC CCAGTTGGTT TACGACAAGG 60 

35 

TATCGACAAA GCAGTTAAAG TTGCTGTTGA AGCGTTACAT GAAAATTCTC AAAAAGTTGA 120 
AAATAAAAAT GAAATTGCGC AAG TAGGTGC GATTTCAGCA GCAGATGAAG AAATTGGACG 180 

40 TTATATTTCT GAAGCTATGG AAAAAGTAGG TAACGATGGT GTCATTACAA TTGAAGAATC 24 0 

AAATGGACTA AACACTGAAC TAGAAGTGGT TGAAGGTATG CAATTTGATC GTGGTTATCA 300 
ATCACCGTAT ATGGTTACTG ATTCAGATAA AATGGTTGCT GAATTAGAAC GCCCATACAT 360 

4S TTTAGTAACA GATAAGAAAA TCTCGTCTTT CCAAGATATC TTACCTTTAT TAGAACAAGT 420 

GGTTCAATCT AATCGTCCAA TCTTAATTGT AGCTGATGAA GTTGAAGGCG ATGCATTAAC 4 80 

AAATATCGTG CTAAACCGTA TGCGTGGCAC ATTTACAGCT GTTGCAGTAA AAGCACCTGG 540 

50 TTTTGGTGAT CGTAGAAAAG CGATGCTTGA AGATTTAGCT ATTTTAACTG GTGCGCAAGT 600 
GATTACTGAT GATTTAGGCT TAGATTTAAA AGATGCATCA ATTGATATGT TAGGTACTGC 660 

55 
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CAGCATTGAT GCACGTGTTA GCCAATTGAA ATCTCAAATT GAAGAAACTG AATCTGACTT 78 0 

TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 84 0 

5 AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AGAACGTAAA TTA CGTATTG AAGATG CATT 900 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 960 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GACATTGAAA CAGGTGT 1017 

w 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1409 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

TTAATC CAG C GTTAACTGTA TTTG CATTTA TTATGATTAT TTCGATTCTT TTAGCGTATG SO 

TATTTAAATG GCTTGGATTA GTGGATGATG TGTTATTAAT GGTCATTATC ATTTCAACTA 12 0 

TTTCCTTAGG CGTAGTTGTT CCAACTTTAA AAGAAATGAA TATTATGAGA ACAACTATAG 180 

GGCAATTTAT CCTATTAGTA GCAGTACTTG CGGACTTAGT AACTATGATT TTATTAACGG 24 0 

TCTATGGCGC AATCAATGGT CAAGG CGGCA GTACAATATG GTTAATAGGT ATATTAGTTG 300 

TTTTCACAGC AATTT CATAT ATTTTAGGTG TTCAATTTAA AAGAATGTCA TTTTTACAAA 3 60 

AATTGATGGA TGGTACGACG CAAATCGGTA TTCGTGCGGT ATTTGCATTA ATAATATTAT 420 

TAGTAGCCCT AGCAGAGGGA GTTGGCGCAG AAAATATATT AGGTG CATT C TTAG CAGGTG 4 80 

TCGTTGTTTC ATTATTAAAT CCAGATGAAG AAATGGTTGA AAAGTTAGAC TCATTTGGTT 54 0 

ATGGGTTCTT TATTCCTATT TTCTTTATAA TGGnTGGTGT AGATTTAAAC ATACCTTCAT 6 00 

TAATTAAAGA ACCGAAATTA CTAATTATCA TACCGATTTT AATCGTnGCA TTTATCATTT 66 0 

CAAAATTAAT TCCAGTCATG TTTATTCGAC GTTGGTTTGA TATGAAAACA A CGATTG CAT 72 0 

CAGCATTTTT ATTAACATCA ACATTATCGC TCGTGATAGC TGCAGCCAAA ATTTCAGAAA 780 

GATTAAATGC TATTTCAGCT GAAACGTCAG GTATATTAAT TTTAAGCGCA GTCATTACAT 84 0 

GTGTATTCGT TC CGATTATT TTCAAAAAAC TGTTTCCAGT TCCAGATGAG TTTAACCGTA 900 

AAATTGAAGT TAGTTTAATT GGTAAAAATC AATTAACGAT TCCTATAGCG CAAAATTTAA 960 

CATCTCAGTT ATATGACGTG ACATTATATT ATCGCAAAGA CTTGAGTGAT CGTCGTCAAT 102 0 

TGTCAGATGA TATCACGATG ATAGAAATTG CTGATTATGA ACAAGATGTT TTAGAACGAC 1080 
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AAGTTGCTAA ATTAGCCAAA GCACATCAAG TTGAGCGTGT CATTTG CAGA CTTGAAAGCA 1200 

CAACGGACGA TACAGAGTTA GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 1260 

ATAAAATCTT ATTAAAAGGT TTAATTGAAA CACCTAACAT GTTGAATTTA TTAAGTAATG 1320 

TTGAAACGTC ACTATATGAA ATTCAAATGT TAAATTATAA ATATGAAAAT ATTCAATTAC 13 80 

GTAATTTCCC ATTCGGAGGA GACATCATC 14 09 



(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4702 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D> TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

AAAGAGGGTT TTTTACTACT ATAATCCATC CTTTAATGGA ATTCCATTTG TCCCTTTGGC 6 0 

CCGTTCATAA CCATAGGAAA AT ATATG CnG nAATCATTGA TAGTAAATAC ATCAATAATA 12 0 

CGAATAATGT CGGTGAAATC ACGTCCGTAT TACCATTATT AGCTAAAACA TTTTCCAAGT 180 

TTTCTTTTGA ACCGGATACT CATAAATGCT TTTAATGCnT GGTTTTGTCT GTCGCCATCT 24 0 

TTAGAATTTG TAATACTTTT TTAAATTCTT TAGCGAACAA CTCATTATCT TTATCGTTTT 3 00 

TAGCCATTTG ACGATTCAAA TCGTTAGCTC TTACGCCTAA TGCTCGACCT GTCGCATGTA 360 

GCTTATTCAC ATTATTGTCG ACTTGATTTA ATTGGCCACT AACAGAATCT GCAATTGATT 420 

TTGATT CTTG TGTATCTGAT AGCAATTGCG TACTCTTTTC AGAAATTCTA CTAATTTCTT 4 80 

TATCTAAATT TGAAGACATC GTATTAAATT CATCATTTTT GCCTTTATCA ATTTTTGGTT 54 0 

CTTGTGGCTC TTCAGCAAAA GTCTTTTTAA CGTTTTCTAA CTGATCAATC AGCTTGGAAA 600 

TATCTTCTTk ATTTTTTGTT gTATTCTTTT TGTTATTTAA AATGTCATCA ATCAGTTTGT 660 

CTGAGTTTTC TTCCATTGAA TCAATTTGaT GTAACACAGC TACTTTATCG TCTTTGAAAC 72 0 

TTTCCATGTC ATTGATAACT TGGTCAACCA TCATATCAAT TAAACGTTTG TTGTCGAATG 780 

GTTTATCTTC tCTGCCTTTT GTATCTGTGT ACATTTTATA ATGCGCATCA AACCTAGATA 84 0 

ATGCACTCAA TTGCTGGCTT AATGCATCTT TCGATAAGCG AC CATCaAGG TTATGATTCA 90 0 

ATGTTACATC CACAACACTC GTTGCTTTCT TATCATTTGG TT CAT CTTGA CGATTTGCTT 960 

GTC CAAATAA CAATTGTAAA TGCATTGTTT TATCTTTTAA GAAATCTTTC TCAGCATCCT 1020 

TTTTCAATTT AGCAACGCCA TTGACTTCAA CTTTATATTC CTTGTTAGAT GTATCGAGTT 1080 
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TAAAATGAGG AtCTGTTGCA ACAGTTAATT GATTAATATC ATTACTTTTA ATCGTTTCAG 12 0 0 

TACGTTGCAC TTTGACACCA TCATTAATCA AACTACTTGT GTCTTGCGCA ACTTGGTTAT 12 6 0 

CGTAATCTGT TAAATTAATG TGTTCTGCTA ACGG TTTTTT CAAATTATAT TCATTTTTAT 132 0 

AACGTTTTGC TTCTTTGACA ATTGCTTCGT ATTTATTAGC TTCATCCTCA TTTAAACCTG 13 80 

CAGCTATAAA GTCTTGTTTA GACATGTTAT AGATAAATGT TGTATCTGTA TCAGGTTCTT 14 4 0 

TGACAATATC ATCATGAAGT TGTTTCTCTA AGTTTTCAGC GAATTGAGCA TTGTTCATTT 1500 

TAATGCTATT TAGCGCATCT TGTAAGTCTT TGTTATTTGC AAGCTCATCT TGCAGTGATT 156 0 

CTGTTAATTG CTTAC GATAn TCTTCAATCA TACCTTTTGA AAATGGTGAC TCTTGTGATT 162 0 

GAATGATTTC TCTTAATTTA TCTAAGTTTT CTTTAACAGT TTGTTTATAT TCTTCTTTAC 16 80 

CTGTATCTTG CATACTTGAT TGTTGATCAA TTTGGCTGTC CATCTGTTTT AATGCATTGA 174 0 

TATAGTTATC AAGTTCCACG CTATCTTTTT GCGATTTATA ATCTTGTAAC ATTTTATCCA 18 00 

TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC GAAACATTAT 186 0 

AATCTGTGTT CACTCTGAAC GTAT CTGAAT TCGCACTCAA TAATGATTTA TTGTATGTTT 1920 

GGAACCATTT TGTAATGTCT TTGTTTGCAG AAATTGAATT TACAAGCGTA TCTGTAAATA 1980 

ATTCCGGGAA GTCGTTAATT GGATTTAATA AGTAATTCGA GAATTTACTA TTCACAC CAT 2 04 0 

GTTCACGCGT CATAATAGCG CCAACATTTT TTTGTGCATT ATGTAAATTA TCAATGATGC 2100 

TTGTTAAATA AATTTCGACT AAGTTTTTGT TAAAGTCGTT AAGTA CATTA CTTACAACTT 2160 

TTTCTGTGTT TTTAGCTACT TCTTCTTTTT GTCCTACAGC TGTTTTATAC TGTAGCGATA 2220 

TTTTCGATGG TGTTTTAGCG TCTAATTGCA TTGCCAATTT TGAAAAGTTT TCTGGGATAA 22 80 

CAATCATGAC TTGGTATCCA CCATTTTTCA AACCAGACTC AGCAACGTTT CTTGTTACTG 234 0 

TTTCAAATTT ATAGTTTTTC TCATTTGCTA ACCTTTTAAT AAATGCTTGA CCCAGCTCAA 24 0 0 

CTTTTTTACC GTTATATGTC GTTGGTTGAT CCTCGTTAAC AATTGCGATA TGTATTTTAT 24 6 0 

TATTTTTATT ACTTACACTT TGGGATCCTT TTTCTGATTG ATCTCCATAT TTTGTTTGAA 2 52 0 

CAAAAAATAT CATACTAACT ATGGCAATTA TAATAATTAA AGTGACAATT AATGCATAAA 2 580 

TCCAATTTTT CTTTTTCATG CTTATTTCCT TTCAGTTGTT TTCTTAAAAA AATGATAAGC 2 64 0 

AAAG CCACAT TAGAAAATGT GACTTTGCCA ATTTCAGAAT GCTTATTGCA AACCGAAATT 27 00 

ATTAGAAAGT TGTTGGTCTT GTTCTTGAAC GGCATCAGCA GTGCtATTcA A T T GT TGTTT 276 0 

AATTTCTTCT AATAATTGTG CAAATTTTTC TACTTTAGGA CTAAGTTGTT GGAATTGCTC 282 0 

TTCGAAACGG CTGAAAGCTT GACCTTCCCA GTTCGCTGCA ATTTCACCTt GTGCACGkGT 2 88 0 
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TCTGATTTCC TCTGGACTCA TCTTAATCAT TGCCATAACT AGAAACCTCC TGAATATTTT 30 00 

AAGTTTATCa AAACTTTTTA GGGACACTAT TTTTTGAAAA AGTGCTCCTT ACTCAAATAA 3 060 

5 TATATAAATT ATTAGTATAT GTATATAGTt TTTTAAGTAT TTTTAGCTTT TTTAAAATAA 3120 

ATATATTGAA TATAACCATA TATTTTTAAT TAACCATTCA TTTTTG TAAT ATAAATGTGT 3180 

ATACTAAAAT TAAATTAAAT ACATAAAGGA TTAAATGGTT ATTATGAAGA AAACAATTTT 324 0 

10 

ACTGACGATG ACAACTCTTA CTTTATTTAG TATGTCGCCT AACTCGGCTC AAGCATATAC 33 00 

GAATGATAGC AAAACATTAG AAGAAGCAAA GAAAGCACAC CCAAACGCAC AGTTCAAAGT 33 60 

s GAATAAAGAC ACCGGCGCGT ATACTTATAC ATATGACAAA AACAACACGC CAAACAACAA 34 2 0 

TCATCAAAAC CAGTCACGTA CAAACGACAA TCATCAACAC GCAAATCAAC GTGATCTTAA 34 80 

CAACAATCAG TACCATTCTT CATTAAGTGG TCAGTATACG CACATTAATG ACGCAATTGA 3 54 0 

20 TTCACACACA CCGCCTCAAA CGTCACCAAG CAATC CTTTG ACACCAGCAA TAC CGAATGT 360 0 

CGAAGACAAT GACGATGAAT TAAATAACGC TTTTTCAAAA GATAACAAAG GGCTTATTAC 3 66 0 

AGGCATCGAT TTAGACGAAT TGTATGACGA ATTACAAATC GCCGAATTTA ATGA CAAAGC 3 720 

25 AAAGACCGCT GACGGTAAAC CTTTAGCATT AGGTAACGGT AAAATCATTG ATCAGCCTCT 3780 

TATCACAAGT AAGAACAACT TATATACTGC TGGACAATGT ACATGGTATG TCTTTGATAA 384 0 

ACGTGCCAAA GATGGACACA CGATTAGTAC ATTTTGGGGA GATGCTAAAA ACTGGGCAGG 3 90 0 

30 CCAAGCTTCA AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT CAATTTTACA 3 96 0 

AACAGTAAAT GGT CCATTTG GTCATGTAGC CTACGTTGrA AAAGTTAATA TTGATGGAAG 4 02 0 

TATTCTAATT TCAGAAATGA ACTGGATTGG TGAATATATC GTTTCATCAA GAAC CATCTC 40 8 0 

35 

TGCTTCAGAA GTTTCATCAT ATAATTACAT CCATTAAATT AATCATGACA TCAATAAAAA 414 0 

GCGACCAGTT CGCAGTTTAC AATTCGTAAC ACTGCAAAAT TGGTCGCTTT ATTTTGTATG 42 00 

40 TTATTCGATT ATAAAATTAC AAAGAAATGT TCTCTACATT CCCCATTAAT CAAAATCGTT 426 0 

TACGAAAGTA TAATTGTAGC T ATAATAAT C CAAGTCGTAA CAACTAGTGG CACTATCGTC 43 20 

TTGAATAAGA ATATAC CGTA TTTTTTCTTG CGATATATAT CCAGTACTAG CCAAATTAAA 43 8 0 

45 ATGATTATAA CACCAACAAA AATAAATACA GGATTCATCG ATATAGCATC TGCCTGTAAC 44 4 0 

TCAGGTTGCA TTCTTAATTT AGTGATAATT AACATCACTA CTGAAATAAT GAAAAAGTAG 45 0 0 

ATACCTCTTA TCTTTGATGT CTGTAAATCT AATTCCTGCT CTTCAATGAC CTCTTTAGAT 4 560 

50 TCACCCAATT CTTTTGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 462 0 

ATTGCAAAGT ACATAAAAAT ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 4680 
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(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pail 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

ACAAAAAATT CACCCTCATT AATATTGAAA CTAATGTTAT CGACAGCAAC ATGTTTGCCA 60 

TAACGCTTAG TTACATTTTT AAACTTAATC ACTTTGCCAC CTCTTTTTTT CTCATAGCAT 12 0 

AAAACCGAGA TTATATGTAT GTATTCCCTA TTTAACCACG TTTATTACAA TTTTCAAATT 18 0 

TAAATGATTT ATCCTTGAAC TTTTTTAATA AAATAATGAA TAAwAGGwAA TCwCCAGTTA 24 0 

AGAAATAGTG TTATTTTACC TTGAATTCAA AAAAACACCC AGTAAAACAA GGAATGCTTA 3 00 

CTAGGTGTCT TCACTATACT TTGG CTTTAT AATTTTGAAT CGTTtCTAAA AATGCTGGAC 360 

AATAATGTTT TAATTTGTAA CTACCTACGC CATCmATATT AATCATATCT TGTTTCGAAG 420 

CAGGCTTACG TTTAGCAAAT TCCTCCAACG TGTAATCAGA AAATATACTT ACAGGTGCTA 4 80 

TCGTTAATTT GTCACTTAAC TTTTTACGAA CTTCTACCAA CTGACTGAAT AATACTCGGT 54 0 

CAACCCCTTC AACCGTATTT ATAAATACTT TTTCAGTCGC TTTTTGCTTA AATGGTGTTG 600 

TGAATACTTC TACTTCATTA CTGAGTAATT TTTTAATTGA AGTATCACAC ATTAATATTT 66 0 

CGTCATTTTC ATTTAAGAAC CCTTTGAATC TTAATTCATC TATTAAGTGA CTTAATTCTG 72 0 

ATGTTGTGTA ACCTTTCATT AAACCATGGG TTGAAATTTG GTCATAACCT TTATACTTAA 78 0 

TATAATCTGk TGACTCTCCT CTTAACACTT GAATGATAAC ACTATAACTC TCTTGTTGTT 84 0 

TCATACGAGC GATGCAACTA ATAATCATCT TAGCTTCTTG TGTCATATTA TATGATTTAT 900 

CTTGTTGAAC ACAATTACTA CATTGTTCAC ATTCTTCTAA TTTTTCATTC GGTTCAAAAT 96 0 

AATGGACAAT TGTTGCTTCT AGACATTTTT TTGTTTTTGT ATATTGAATC ATTTTAGTTA 102 0 

ACTTTTCGCC CATTTTATCT TTATAGTCAT CATCAGCTTG AGAGACTGTT ATAAAATACT 1080 

CGTGTAAATT GATAtCGCGT TCGCTAAATA ACAAAATACA TTCACTTTTC AACCCGTCAC 114 0 

GACCTGCACG ACCCGCTTCT TGATAATAAG ATT CTAAATC TCCAGGCATA TTATAATGAA 120 0 

TAACAAAGCG TACATTGGAT TTATCAATAC CCATACCAAA AGCATTTGTA G CAACGACT A 126 0 

CTTTAACACG ATCAAATAAG AAAT CATTCT GCGCTTCTTC TCTTTCTTTA TTGCTCAAAC 13 2 0 

CTGCATGATA TATAACACTT TCAATTTTCT GACTTT CTAA GGCTTCTTGA AGCTCTTCAA 1380 
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CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 1500 

TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 156 0 

GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 162 0 

ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 1680 

AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT 174 0 

ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 18 00 

CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 18 60 

TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TG AT ATTAG T GGACTTATTA 1920 

CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCraTATAGAC yTACCTCCAC 1980 

CAGTtGGkAA GrCACCAAGC A CAT 2 0 04 
(2) INFORMATION FOR SEQ ID NO: 2 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

AAAGATTGCT TGCCTTGAGG GTTTATATAT CTGACTCAAT TGCCACATTT TT AT CAAG AG 6 0 

TAGTTGATAA TACTCATCAT AATTATAGCT AATATTATAT TTTTTTAAAA GATAGTGTAT 120 

GATTTTCTGG TGTTTGTTGT ATACGTCATT AAATTTCAAG TAGTCATTCT C CAAGTT AT A 18 0 

CGTATAACAA ATATTTCCGG ATAAAGTTAG AATAAAATAT TTAGAAAAAT CATTCATTTG 24 0 

CGTAATCGCT AAATTAAGTG TTAAATATAA GACATAAGTA ATTAATTTAA TGTGATATGA 300 

TGTATTATTA CTTTG CTAAA TAGTAGATAG AACAAAATTT GTAATCGGGA GGTAACAATG 360 

GATTACGCAC ATTTAAATTT AGAACATTTT TTTGCACGAA ACGACGATTT AGATGTTATA 42 0 

AGAGATCGCG CTGATTTCGT GATGATAAAT AACTTCACTA ATGAAATGAT GTATCGTGAT 480 

GGTCAAATTG AAGGCACGAT TGATTTAAAT CAGTACTATT ATAAAAATAG ATCAAATGCA 54 0 

GCAAGTTTTA TTATGATGGA TTATAAAAAA GAAACTAAGT AAACGAACAA AAGAATTTTT 600 

TGTTTTTTAA TACGTGAATA ATAAGATTAT TGATATAAAG GTTTTCAAAG GTTATACAAA 660 

AAGATAAAAC ATTTATGATT CGTAGATCAA CGTAAAGTAA TGTTGATAAA TGGTTTAAAA 720 

CGTTTCATTT ACATTACTGT TTATTTATGA ATATGTAACA ATGCATAGAT AAAATTGTTA 780 
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ACCTAAGAGG TGTGGATATG AATAAACACA AGAAAGGTTC TATTTTT GGA ATAATAGGAC 900 

TTGTTGTCAT ATTTGCTGTT GTCyCaTTTT TATTTTT CTC AATGATATCC GATCAGATAT 96 0 

5 TTTTCAAACA TGTTAAATCC GACATTAAGA TTGAAAAGTT AAATGTTACA TTAAACGATG 1020 

CAGCAAAGAA ACAAATAAAT AATTATACGA GTCAACAGGT ATCAAATAAA AAGAATGATG 1080 

CATGGAGAGA TGCATCTGCA ACTGAAATTA AAAGTGCAAT GGATAGCGGT ACTTTTATCG 114 0 

w 

ATAATGAAAA GCAAAAATAT CAATTTTTAG ATTTATCAAA GTATCAAGGG ATTGATAAAA 12 00 

ATAGAATTAA ACGTATGTTA GTAGATAGAC CAACGTTATT GAAACATACG GATGATTTCT 1260 

TAAAAGCTGC TAAAGATAAG CACGTTAACG AAGTTTATTT AATTTCACAT GCATTATTAG 132 0 

75 

AAACTGGCGC AGTTAAAAGT GAATTAGCTA ATGGAGTCGA AATTGATGGC AAAAAGTACT 13 80 

ACAATTTCTA TGGAGTAGGA GCCCTTGATA AAGACCCAAT TAAAACAGGT GCAGAATATG 14 4 0 

20 CTAAAAAGCA TGGTTGGGAT ACACCTGAAA AAGCTATTTC AGGCGGTGCT GATTTCATTC 15 00 

ATAAGCACTT CTTATCAAGC ACAGATCAAA ATACATTGTA TAGTATGAGA TGGAATCCAA 1560 

AAAAT CCAGG AGAACATCAA TATGCTACAG ATATTAAGTG GGCAGAAAGT AATGCAACAA 1620 

25 TTATCGCTGA CTTTTATAAG AACATGAAGA CTGAAGGAAA ATACTTCAAA TACTTTGTGT 1680 

ATAAAGATGA CAGTAAACAT TTGAATAAGT AATTTGATAA GCTACGAGTT GTTTTTATGA 174 0 

CTCGGACATA CTAAAAAGAC GCTTTCTATC TTGTTTTGAT AGAAAGCGTC TTTTTGCATT 18 00 

30 AGAGAAAACA CATTGATkGA TAAtCCCaCC aATGCAAgTG GGGcAGGACa TCGATAAAGA 1860 

ATTACTTTTT CTTTAGAAAT TAGTATTTCT TATGCATGAG TTTTACT CAT GTATTCCTAT 1920 

TTTTAAGTAC ACATTAGTTA TAGCTAATGA TAAAGAACCA CTACATAATA AATCATTAGT 1980 

35 

GTTTTTTTAT CATTTCTGTC CCaCTCTCAT CGTGATTTGA AATTTTCAAT TGCGATTTTA 2 04 0 

ATTTCATCTC TTACACGTTG GAACTCTGAC CAAGGCTTGC CTGCAGGATC ATCAAATCCC 2100 

40 CAATGTTCTT TCTTAACATT TGTTGGTAAA GAAGGGCAAT TTACGTCTGC AT CACTACAT 2160 

AATGTAACAA CTAAATTTGA ATTTnTAATA ATATTATTAT CGGATTAAAA TCTGATGGAT 2220 

GATTTGATAT ATCAATGCCT ACTT 2244 

45 (2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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ATAAG CAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 60 

AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAATATTATG 12 0 

TTGGTAAAGG TAAAATTGAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 180 

TCATCACAAA TGATGAATTA ACGACTGCAC AATCCAAATC ACTAAATGAA GCTTTAGGTG 24 0 

TAAAAATTAT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 300 

AAGAAGGTAA ATTG CAAGTA GAGCTAGCAC AACTTGATTA TTTATTACCT AGATTGCAAG 360 

GCCATGGTAA AAGCCTTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 42 0 

CGAAGTTAGA GATGGATCGC AGACATATTC GAACTCGTAT GAATGAAATT AAACATCAAT 4 80 

TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AAT CAGGTGT 54 0 

TTCAAGTAGC TTTAGTTGGT TATACAAATG CTGGTAAATC ATCATGGTTT AATGTTTTAG 600 

CAAATGAAGA GACGTATGAA AAAGATCAAT TATTTGCAAC GTTAGATCCT AAAACACGAC 660 

AAATTCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT TTTATACAGA 72 0 

AACTAC CTAC GACGTTAATT GCAGCTTTTA AATCAACTTT AGAAGAGGCT AAAGGTGCAG 78 0 

ATTTATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG TATGACACAG 84 0 

TTAATGATTT AATCAAACAA TTAGATATGA GTCATATTTC TCAAATAGTT ATTTTTAATA 90 0 

AAAAGGACTT ATGTGATCAT GCATCAAATC GTCCAGCAAG TGATTTGCCT AATGTTTTTG 96 0 

TTT CTTCTAA AAATGATGGT GATAAATTAC TTGTTAAGAC GTTATTTATT GATGAAATCA 102 0 

AAAGG CAATT AACTTATTAT GATGAGACAA TTGCGACGAA TAATGCAGAT CGATTATATT 108 0 

TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 114 0 

GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA 1200 

TAGTAGCTGA CGTCGAATCA ACGTTAGCAC CATATTTTAA AGAAATTGAA GAAACAGCAT 1260 

AT ATT AAT CA AGAAAAAGTA TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 13 2 0 

TACAAGGATC AACAGGATAC GGGTATGATG ACTTTGGACG TGATCATTTA G 13 71 
(2) INFORMATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
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CGTAAAcCTA TGCGTTTTAA TATTCTGAAG TTACTTAGTT CATCCTCAGT TTCATCCATT 12 0 

TGTTTAATAT AAATAATACA TCCAGCTGCT ACTAAAAATG CTAATCCTAA AAATGATGTA 180 

£ ACAAATATTA GAATACCGTT AGTAGCATcG ACCTCTTTTT TCATTTCATC ATACGTGATG 24 0 

ACTTTGTCTC CAAACTGTTT TGCAATTGCT TGAGCTTTTT CCTTTTGTGA TGTTTGTTTA 3 00 

ATATCATATC CATAAAAAGT ATGAACGTTA TTTTGTGTTT TCAACTGCTG ATACTTTTCA 36 0 

'° GGACTTACTT CGATGACAGG TGAGTTGAAG CTTAGATTTA AAGGATAAAC CTTACCTTTG 42 0 

TCTTCTTGTG TAcAC GGAAA GTTTCATTCT TAGTACCTTT TACTACTAAA TCTTTGTTTA 4 80 

AATGGATATT AATAATGTTA GGCAGCGATT TTGTATTTGT AATGATGGCA TTGTTGCCTG 54 0 

15 

TtAACTTGTT ATTTGCACTT AAAATAGAAT TCGTGCGACC TGAATCACTA CCATTTTCCA 600 

AAGTAATAAC CTGATCATTA ACATTATCTA CAGTAATAGT TTCGTAAGCA TTTTTAGAAA 660 

20 ATGTAATTTG TTGTTGGCTT AGTTTAGTTT CAAATTGTTT AGCATCTTGA GTAGCGACCA 720 

CGTTAAATTC ATTTGGTGCC ATAGATGTAA GGGTTTGATC TGTATTTGAT TTAGATAATG 780 

CCGCAAAACA CAATACAGTT ACTGTAACTG CAGAAATAAT TGCAATGATA GTTAAAGACA 84 0 

25 TGGCATTTTT CTTCATTCTG TACATAATAG ACGATGTGAA TACAACATCG GTAATAGATA 90 0 

CGCGTCCATT TTTTGATTTT TTCAATGTTT TAAAAATAAG TGACACGGAA CTTCTGAAGA 96 0 

ATAAATAGGC GCCTACAACC GTTAAAAATA AAATGATAAA CGGTGATGTC ATAGCCATAG 102 0 

30 TTAGTGCTTT GAACGTACCA AACATTTCTG TCGCCATATA ATAGCCTAGT GCAATCATAG 1080 

CAATACCTAA TACGCCTGAA ATAACCTCTG CAGTCGTTAC TTTAGCAGTT GTGGCATCAG 114 0 

TTTTAATTGA ATCTTTCATC ATTGATAAGA TACTACGTCT TTTTAGAAAT AAAG CACTTT 1200 

35 

GAAATAAAAT CAGTACATAC GCAATAATTA GCATGAAAAT AGTTAAAACA AGGGCCATAG 12 6 0 

GTTCGAAATG TATCGATAAG TTAATCGATA ACGACATCAA TTTAGATACT ATGGAAAGCA 13 2 0 

4Q ATAa TTGTGC ACCTGCAATG CCACATAATA CACCGACAAC ACCTGTGATT AAAAATACGA 138 0 

TCATTtGTTC AAGTGCTAAC ATTTnCAAAA TGTTTTGTCG TGTTAAACCA ATCaACTGAA 144 0 

ATAGCG CAAA TTCACGTGTA CGGCGTTTTA CGrmTAAATG ATT GG CAT AC ATTAAAAAGA 1500 

45 TGACAATAAT GATAAATAAA AATATTGATC CGACTAAAGC ACCTTTCTTA ATGATGGC CA 156 0 

TCGAGTCGTC ATTATTTACA CCTTTAGTAA ACTGTAAGGT TGTAAAACTG AAATATAAGA 162 0 

CGATGCTAAA AAATAATGAA AATAAATACA TTGCATAATG TTTTAAGTTT TGTCGTAAGT 1680 

50 TTTTGAAAAC GATATGGTTA AATGTCATTT GAGACACCAC CTAATACTGA TTGAAGATGT 174 0 

ACAATGTCTT CATAAAAGGC CTGTTTAGAA CGTCCTTCCT GATAAAGTTG TGTATGAATT 180 0 
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ACCATGACAA TAGTTGTATC AAACGATTTA TTCATTTCTT CCAAACGTTG TAATAGGTCA 192 0 

TTTGCACTTT TCGAGTCGAG TGCGCCTGTT GGCTCATCTG CAAATATGAT TTGTGGTTTG 198 0 

TGAACAAATG CTCTCGCTGC TGCAGTTCTT TGTTGTTGAC CAC CAGATAA TTCGCTAGGG 2 04 0 

TATTTATTTC CTAGGTCATA AATAC CTAAT GCTGTCGTGA TCGCTTTATA ATTTTCTTCC 210 0 

ATTGTTGCCT TCGACATTTT TTGAACAGAT AAAGGTAACA TAATGTTTTC TTTAACGGTT 2160 

AATGTCGGCA GAATACTGTA ATCTTGGAAG ATGAAACCTA ATGATTCTTT GCGGAATTTG 2220 

GCAAGTGCTT TTTGATTAAG TTTATTAAGC TCTTGTCCGT TAGCAATCcg CTAC CGCTAG 22 8 0 

AAATTTGGTC AATTGAACTT AGTACATTTA ATAAGGTTGT CTTACCTGAT CCAGAAGGCC 2 34 0 

CCATAATCGC AACGAATTCG CCTTTTTGTA TGTCAAAGTT AATATCTTTA AGTGCTTGAA 24 00 

ATGTGTGCTT TTTACCGTAT GTTTTTGAAA CATGTgCACT GATAATATCG TCATAGTCTC 2460 

ACTCCTT t TG TATTTAATTT CATTTTAAAT AATGTTTGGA GTAGTAGCCT TTAT CTAAAC 2 520 

TTACAATTCA ATGAATGAAC CTTACAGAGT TGAAArcTAT CGCTACTTAG TAGATTTTTG 2580 

AGTGAGGATA CAGATTCATC GTACATATTA GACAAAAGCA ATGGTG CTTT CTAAGTGATG 264 0 

ATGTTTGTGT AAATTGAGAA AAGGGAATTT AATTATTGTA TAATAAATTT TTTGTAAAAA 2 700 

TTAAAAGAGG GTTTTATTTG AAAGGAATTG ATTGTTATGG AAAAAGGAAA TCAAGGTATT 2760 

AAATGGTCTA GTTTAATAAT GGGTGTATTA TTATTAATGT TGGCAGTCGT TATTTTTACA 2 82 0 

TTTCCAATTG AAAATTTTTA TGCTATTACC TGGTTGATTG GACTGTTTGT ATTAATTAAC 2880 

GGTGTGATTC AAATCGTTTA CCGTAGAAAA GCAAAAGCTT TAGTAGGTGG TAACCAAAAT 2 94 0 

TGGATTCTGT TTATGGGGAT TGTAGATATT CTATTTGGTC TATTAGTTAT TTTTAATGTT 30 00 

GGCGCAAGTT CAGCATTCTT TATTTATATG TTTGCTTTTT GGTTTATTTT TAGTTCTATC 3 06 0 

TCTGGATTAT TTACGTTTTC GGGTAGTGGT AGCTTAAAAC TAATTTCAGT GATTTTTAAT 312 0 

TTATTAGGTA TTGTTTTCGG TGTCATTTTA TTATTTAATC CATTAATGGG TATCGTCTTT 3180 

ATTTCGACGA TGATTGCTAT TGCATTTGTA TTCGTAGGTG TCATTTATGT TGTAGATGCA 324 0 

CTTGCTTAAG TAAAATGAAG CGGTTCAAAA GAAGGGTGTG ACATGAAGTT TGTGTCATAT 3 300 

CCTTTTTGTT GTGTTTATGA AGCATAAAAA AGGGGCGCTA CCTACAATAA GTAAGATACG 3 3 60 

CCCATATTTT TATATTTTAC TATTATTGTT TTTCAATACG ATTAATAGTT ACATTTAGTC 34 20 

CAAAATATTT TTCTAAAAAA TGTTTATAGT TATCTTTAGT GACATCAAAT TTTTCTGAGC 34 80 

TACCATTCCT TGTTAAAGTT AAATGATTTT CAGACATTGT AGCACGG CCA AATGATTGTG 3 54 0 

GCATTGTAAT TAATAAATGC TGTACAAATA TTGAAT CTGG ATG CGTTTG A TTATATTCGA 3600 
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AATGATCATT TTCGAATTTT TGAACATAGA AAATATCCTT GTCTTCGTTG TTAAAAATAG 3720 

CGCGGAATGT ACCACTGATA TCAGTAATTG GTTGTGTATG CTCAGATGAA GTAATAGGAA 3 780 

TGGCATGTAG AGGTAAGTCT CCAAAGCCAA CATCAGTTAC ATAGAATACA TCATTTATAG 3 84 0 

AAACAACAAG TGAAGCATGT GAACCGTTCA GACTACGACC GCCACCGGGw GTGTGAATAG 3 90 0 

TAGCTGACAT TAATTCAGGA TTAAATCCTT TTTGTTGTAA ATAGGCTTTG AAAAATGTAT 3 960 

TTAATTCATA ACAAAAACCA C CACGTTTAT CATGAACAAT TTTATTAAAA AGTGCATCGA 4 020 

TATTTATAGA TATCGGCTTA CTATTTTGAA CATCAATATT TTCAAAAGGT ACAGTTAACA 4 080 

TAAAACGTGT TGCATAATAA TTTAATGCTT CAATACTCGG TCGATTATAA CGAGATGAAT 4140 

CAATTTGTAA ATAATTCTCT AACTTCGCAA TATTCATAAG CATAGCGCCT CCTGTATTAA 4 200 

AGATTATAAT TAAATTTTAA ACAGAAATAC TGAAATTTTA AATTCGAAAG CATTGAATTT 4 260 

TGGATAAATA CATTTTAAAT AGAAAAATAC GCTCTCAAAA TGAAGTCATC TCTAAAAGAA 4 32 0 

ACGATTTAAA GATGACTACT GAGAGCGTAg CATAATGGAA GAAGTGTGCA GGGTGTCTAA 4 3 80 

AAATGCAACA ATACAAAGGT AGTTGCAAGA CAAGTTGCCT TATCTAGACC ATTTGTGTTC 444 0 

TATGCGACCA AACTTCCAAA TTAAACTTGA AATAAGC CAA GTAATTAAAA ATAATG CAAC 4 500 

TAAAATATAG CCTAAATAAT CAAATTCGAT CGAACCAATG AATGCCCAAA ACGCACCATG 4 56 0 

TAAATCTAAC TTATCAGCAA GAATTTGTAG CAATTCAATC ATCCCAATCA CTAATGCTGC 4 620 

CATGACTGAT ATCGCAGTAA T CGTTATATT GTAATAGATT TTGCGAATAG GATTGAAGAA 4 680 

TGCCCAATTA TAGGCATACT TCATTACAAC ACCATCTAAT GTATCCAATA AACTCATACC 4 740 

TGATGCGAAT AAAATTGGTA AAGATAAGAT TCCGATAAAT GAAATGG CTT GTTGTGATGC 4 800 

GCCTGAAGAA AGAGCGAGTA ACGCAATTTC ACTAGCTGTA TCAAAACCAA GTCCAAATAA 4 860 

AAAGCCAAGT GGCAATACGT GCCAACTACG CGTGATTAAT TTGAAATAAG GTCCTACAAA 4 92 0 

TCGAGAAACC AATCCTCTAG ATTCAAGTAA TGCATCGACT TCAGCTTCTT CAATGTGTTC 4 980 

ACGACGTAAT TTAGCGAACA AGTTAATTAA AGAGATTAAA ATAATTAGAT TCAACACACC 5040 

GATAAGCACT AAAAAGAAAC CTGAAACTAG TGTAC CAATC GTTCCACCAA TATCTTGGAA 5100 

ATG CGGTAAT TCATCTTTAG CCCATTTTAC AGATACCCCT AAAAAAACAG CCATTAAAAA 516 0 

TACGACAGAT GAATGTCCAA TTGAAAAATA GAAACCCACA CCAGATGGAT CTTTGCGTTG 522 0 

CTGTAATAAT TTGCGaACCG TATTATCTAT TGCAGCAATG TGATCTGCAT CAAATGCATG 5280 

ACG CAAAC CT AATGTATATG CAAGAATCCC CaTACCAAAT AAGATATGAT GGTCTTTTCC 534 0 

AG CAATC CAT AAAAAACTAA ACCCAATAAC GTGTAACAAA ATGACAATAG CTATGTATGG 5400 
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ATATTTAATC ATACTGTATG TT CAATGGGC ACTCTAGTAA TAAGTGTTCA TATAACAAAA 5520 

ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT AACCACAAAA GATTTTTTGC 5 58 0 

GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG ATGTCACTTT GCTTAACTTG 564 0 

GCATCCTTTT TATGATTTT C AAATTCAAAA AAATGAGCAA AATGAATCTC TTTAC CAGTT 5700 

TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT GTGTGATGCT GGAATGGATA 5760 

TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT GAATTAATAT TTGAGGTCCG 5820 

TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG TTGTAACCAG TGAACGTAAT 588 0 

AATTCAATAT CTAATGCTGC TGTAACAACT ATAAAATCTA TCATTTGTTG ACGTTTAGGC 5 940 

GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA ATGATGACTG AACTTCCGTT 60 00 

TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 603 5 



20 (2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 273: 

30 TTTTGAACAG CCATATTTAT TCACCCTCAA CATCATTATA ATGGTATTAG TCGCATTACC 60 

TTCACTTGTT TTAGCTATAT ATGATTATAT GAGTTTTAGA ATTTCTTCTG CTATTTTACA 12 0 

ATTTCTAGGG GCTATCTCTT GGTT CTTTTT ATCATTGATA TTATCGCTCA CACAATTTAC 180 

35 

ACCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG TTCACAAGCA CAATTATCAC 240 

ATT AG C CATT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CCTTAAATTC CAAATGAAAA 3 00 

AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT TCTATTCAAT 3 60 

40 

GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG AAACTTTGTT 42 0 

AAAATAAACC ATCAAACAAC GAATGACAAC TTGATGTGCA ACAATGACAA TATCATCTTT 4 80 

45 TTGTGTATCT TCATTGACAA CATGATTCAT AAAATGTTCT ACGCGTTGAT ATACATCTTC 54 0 

ATAACTTTCT CCTTCAGGCG CTTTTTGTGA AAAACTATGA CGAAAGTCTT TAAAGTTTGG 600 

ATCATTGAAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT ATTCACCCTC 66 0 

50 AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGTAGTC GTTGATGCAA TGTCATATGG 72 0 

AAAAATATGT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA CATGTTTAAT 780 

55 
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CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCGATTT 900 

TGATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATTTC 960 

ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 102 0 

AAAGAATTAA TTCATATnT 103 9 



(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 



20 


GAGAGAATnT 


G CAATTAGTT 


ATT CAATTAG 


TTGATTTAAG 


ACATGATCCA 


ACACAAGATG 






ATATCTTAAT 


GTACAATTAT 


TTGAAACATT 




TACTTTAGTT 


ATATGCACTA 






ArGAaGACAA 


AATTCCaAAA 


GGTAAGGTyC 


AAAAGCATAT 


TAAAAATATT 


AAGACACAAT 






TAGATATGGA 


CCCAGACGAT 


ACAATTGTAA 


GTT ATT CATC 


AATTCAAAAT 


AATAmaCAAC 


240 




AACAAATATG 


GAATTTAATT 


GAACCGTATA 


TTTCATAGTT 


TTTGTACGTC 


AAAACTTATA 






CAAAAATTTT 


AAAAATAATG 


TAAGCACGAA 


ACTTTTAATT 


AGTACACAAT 


TGATAACATT 


360 


30 


TTTCAACGTT 


CATCATTTTG 


TCAAAAACTC 


AAAAGTAAAT 


TAGAAAGATT 


ATAATTTATT 


420 




TAAGCATCGT 


ACTTAATTGG 


ATTTTAAATT 


ATGTTATAAT 


ATTTGTATTG 


TTAGTATATA 


480 




TGGGGGCTTT 


TCAAATGCAT 


TTTATTGCAA 


TTAGTATAAA 


TCATCGCACA GCTGATGTgC 


540 


35 


ACTAAGAGAG 


CAAGTTACTT 


TTAGAGATGA 


TGCCTTACGA 


ATTGCCCATG 


AAGATTTATA 


600 




TGAAACTAAA 


TCTAyTTTAG 


AAAATGgTCA 


TATTaTCAAC 


ATGTAATCGA 


ACTGAAGTAT 


660 


40 


ATGCTGTTGT 


TGATCAAATT 


CACACAGGTC 


GTTACTATAT 


TCAACGATTT 


CTAGCT CGTG 


720 




CATTTGGATT 


TGAAGTAGAT 


GATATTAAAG 


CAATGTCAGA 


AGTAAAAGTG 


GGGGACGAAG 


780 




CaGTAGAACA 


TTTATTGCGT 


GTCACTTCTG 


GTTTAGATTC 


AATCGTACTT 


GGAGAAACTC 


84 0 


45 


AAATTTTAGG 


TCAAATAAGA 


GATGCATTTT 


TCTTAGCGCA 


AAGCACAGGT 


ACGACAGGrA 


900 




CAATTTTTAA 


TCATCTATTT 


AAACAGGCAA 


TTACTTTTGC 


AAAAAGAGCA 


CATAATGAAA 


960 




CAGATATAGC 


TGATAATGCT 


GTAAGTGTGT 


CTTATGCTGC 


GGTCGAGTTG 


GCGAAAAAAG 


1020 


50 


TATTTGGCAA 


ATTGAAAAGT 


AAGCAAGCTA 


TCATTATTGG 


TGCAGGGGAA 


ATGAGTGAAT 


1080 




TATCACTATT 


AAATCTTCTT 


GGTTCTGGAA 


TTACTGaTAT 


TACAGTAGTA 


AATAGAACAA 


1140 
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TACCAAATTT AC TTGAAAGT GCAGATATTG TGATTAGTTC 


AACGAGTGCA 


CAATCTTATA 


1260 




TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG 


AAAG CAAG AT 


TCACTAGTAT 


1320 


5 


TGATTGATAT TGCAGTTCCT CGAGATATTG AAC CTGGTAT 


TAGTGCCATC 


ACAAACATCT 


1380 




TTAATTATGA TGTTGATGAC TTAAAAGGTT TAGTTGATGC 


AAACTTACGT 


GAGCGACAAT 


1440 




TAGCGGCTGC AACAATTTCG GAACAAATTC CTACAGAAAT 


ACATGCACAC 


AATGAG 


1496 




(2) INFORMATION FOR SEQ ID NO: 275: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4826 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 








CTTGATTTTT TCCCtTTAGT ATTTTCCaTt TGanTGTCGC 


AGCTTCTAAA 


TCCTGCTTTG 


60 




GTTCTCTAGT GAACTTCATA ATTAAAGCAG CTACAACGAA" TGATACAAGT 


GCAGCAAGGA 


120 


25 


AGACACCGAG TAACATGTGC AAGAATTCAC CTCTAGGTGC 


ATTTAAACAG 


TAAACTATAA 


1B0 




ATGAACCTGG TGACGCGGGA CTTTTAAATC CAAATCCTGT 


TGCTTGATAA 


GTTGCAACAC 


240 




CAGTCATTCC ACCTAAAATA A CAG CGAT AA ATAATAAAGG 


ACGCATTAAT 


ACATATGGGA 


300 


30 


AATAAATTTC ATGAATACCA CCTAAGAAGT GGATAATTCC 


AGCACCATAT 


GACGTTGCTT 


360 




TTGCAGTGCC TTTTCCAAAA ATCATATAAG CAAGTAAGAT 


ACCTAAACCT 


GGTCCAGGGT 


420 




TAGATTCAAT TGTGTATAAA ATTGATTGAC CAGCTTTTGC 


AGCTTGATCT 


GCACCAAGCG 


480 


35 


GTGTGAATAC ACCATGGTTA ATCGCATTGT TTAAAAATAC 


AATTTTTGCA 


GGCTCTACTA 


540 




AAATACTTAC AAGTGGAAGT AGGTGTGCAT GTACTAATGC 


TTCAACTGCC 


ACTGATAAAA 


600 


40 


TATGCATAAT AAATTTCATA AGTGGTGCTA AAATTTTAAA 


TCCTGCAATC 


GTCATGATAA 


660 


ATCCTAAAAT AC CAG CAG AA AAGTTATTAA ATAACATTTC 


AAAACCTTGC 


GGCGTTCTAG 


720 




GTTGAATCAA TTGGTCGGTC TTCTTCATTA ACCAACCAAC 


AAGTGGACCC 


ATAATCATTG 


780 


45 


CAC CAAGTAA CATTGGTGTA TCAGGTAATG CAACGATGAC 


CCCCATAGTT 








CGATGATACC AC CACGTAAA TCATAAATTA AACGACCACC 


ACTAAATGCG 


ATCAATAATG 


900 




GGATTAAATA AGTAATCATT GGTCCTGCTA AAGTAGCTAA 


ATCTTTGTTA 


GGTAACCATC 


960 


50 


CATTATCTAT AAAAATGGCC GCGATAAAAC CCCAAGCGAT 


GAAAGCGCCA 


ATGTTTGGCA 


1020 




TGATCATACT ACTTAAGAAT GATCCAAATG CTTGAACACG 


ACGACCAATT 


CCTTTTTTCT 


1080 



55 
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GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATCAACTT TGTCAAAATA AATGTGACAA 12 00 

AATTAAATAA AGTGTCATCA ATGTGACAGT ATAGATATTT TGAAAAAGTA AAACAAAAAA 126 0 

5 ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT 1320 

gTACAATAAT GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 1380 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATATATGA GTGGCCTTTA TAGAGTGCAA 144 0 

to 

TATATGTATT TGTAGACGAG GAGGATAGTG ATCGAATAGA TCGGCGGATG CTATCCCGGA 1500 

T3TGGC TCAT TCGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACAAAGTTAA 1560 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 1620 

15 

TGTATAGTTG TAAAAATAAC TATG CTTAAT TTGTTATGGA TGAATG CG AT GATAGCATGT 168 0 

TCCTATTTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA TCGGCAATAA ATCATAATTT 174 0 

20 ACGCGTACTA TTC CAATATG GAGGAAAATG TCGTTATGTG TGGAATTGTT GGTTATATTG 180 0 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGG TTTAGA AAAATTAGAA TACAGAGGTT 1860 

ATGACTCTGC AGGTATCGCA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 192 0 

25 GTCGTATTGC AGAATTACGT AAAGTTGCTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA 1980 

TCGGTCACAC ACGTTGGGCA ACACACGGTG TACCGAATCA TGAAAACTCT CATCCACATC 2 040 

AATCATCAAA TGGCCGTTTT ACTCTAGTTC ATAACGGTGT TATTGAAAAC TATGAAGAGT 2100 

30 TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AACAGATACA GAAGTTATCG 2160 

TTCAATTAGT TGAATACTTT TCAAATCAAG GACTTTCAAC TGAAGAAGCA TTTACAAAAG 22 20 

TTGTGTCATT ATTACATGGT TCATATGCAT TAGGTTTATT AGATGCTGAA GACAAAGACA 2 280 

35 

CAATCTATGT TG CTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTGAA GGTTTCAATG 2340 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA GAAATCCATG 24 00 

ACCATGAAAT CGTTATTGTT AAAAAAGATG AAGTTATTAT TAAAGATGCA GATGGAAACG 24 60 

40 

TTGTAGAACG TGATTCATAT ATTGCTGAAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 2 520 

ATGCACACTA CATGTTAAAA GAAATTCATG AACAACCAGC AGTAATGCGT CGTATTATTC 2 580 

4S AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACATCATC AATGATGTTA 2640 

AAGAAGCAGA CCGCATTTAC GTTATTGCAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 2 700 

GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT GCATCAGAGT 2 76 0 

50 TTGTCTACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 2 82 0 

GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAATTAGGT CATAAATCAT 2 88 0 

55 
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TACACGCGGG TCCTGAAATC GCAGTTG CAT CTACAAAAGC ATATACTGCA CAAATTGCAG 3 00 0 

TATT AT CAAT CTTGTCTCAA ATCGTTGCAA AAGAGCaTGG TCGTGAAGCA GATATTGATT 3 06 0 

5 TATTGAGAGA ATTAGCAAAA GTAACAACAG CAATAGAAGC AATTGTTGAC GATGCACCAA 3120 

TTATGGAACA AATTGCTACA GATTTCTTAG AAACAACACG CAATGCATTC TTTATCGGAC 318 0 

GTACTATTGA CTATAACGTA AGTTTAGAAG GTGCGTTAAA ACTTAAAGAA ATTTCTTACA 3 24 0 

'° TTCaAGCAGA AGGTTTTGCT GGTGGAGAAC TTAAACATGG TACAATTGCC TTAATCGAAG 33 0 0 

AAGGTACACC AG TTGTAG GT TTAG CAACAC AAGAGAAAGT TAATTTATCA ATTCGTGGTA 3 36 0 

ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT TATTTCTATG GAGGGTCTTG 3420 

75 

AAAAAGAAGG CGACACTTAT GTCATTCCTC ATGTACATGA ATTGTTAACG C CATT AGTAT 34 80 

CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT ACACAGAGAT TTAGATGTTG 354 0 

20 ATAAACCACG TAACCTTGCT AAATCAGTTA CTGTGGAATA ATTCACTTTT TTAGAATCAA 36 00 

TCATGTATTA AAATTAAAGT ATATGGCACC CTTTTAGATT AATCGACTAG AAGGGTGCTT 3660 

TTTTAGGTCG ACTTa GCTTT TACTTCATCT TAATTTGGCA GAAATGCGTa AAAATGAAGT 3720 

25 GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT GTTAATATGA TTTTATGAGA 3780 

AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT TCAATTAGAT CAAATTGGGC 384 0 

GAATGAAGCA AGGAAAAACA ATTTTAAAAA AGATTTCTTG GCAAATTGCT AAAGGTGATA 3 900 

30 AATGGATATT ATATGGGTTG AATGGTGCTG GCAAGACAAC ACTTCTAAAT ATTTTAAATG 3960 

CGTATGAGCC TGCAACATCT GGAACTGTTA ACCTTTTCGG TAAAATGCCA GGCAAGGTAG 4 020 

GGTATTCTGC AGAGACTGTA CGACAACATA TAGGTTTTGT ATCTCATAGT TTACTGGAAA 40 80 

35 AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG CGGTGCCTTT AAATCAATTG 4140 

GTGTTTATCA AGATATTGAT GATGAGATAC GTAATGAAGC ACATCAATTA CTTAAATTAG 4 200 

TTGGAATGTC TGCTAAAGCG CAACAATATA TTGGTTATTT ATCTACCGGT GAAAAACAAC 4260 

GAGTGATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT TTTAATTTTA GATGAGCCAG 4 320 

CAGCTGGTTT AGACTTTATT GCACGAGAAT CGTTGTTAAG TATACTTGAC TCATTGTCAG 43 8 0 

45 ATTCATATCC AACGCTTGCG ATGATTTATG TGACGCACTT TATTGAAGAA ATAACTGCTA 444 0 

ACTTTTC CAA AATTTTACTG CTAAAAGATG GCCAAAGTAT TCAACAAGGC GCTGTAGAAG 4 500 

ACATATTAAC TTCTGAAAAC ATGTCACGAT TTTTCCAGAA AAATGTAGCA GTTCAAAGAT 4 56 0 

50 GGAATAATCG ATTTTCTATG GCAATGTTAG AGTAAATATT TTGCAAATAA TAAGTAATAA 4 62 0 

TGACAAAATT TAATTAAGAT AAAATGGACA GTGGAGGGCA ATATGGATAA CGTAAAAGCA 4 680 
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AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 4 80 0 

TCGCATTCTG AAAATACATC AACTTG 4 82 6 
5 (2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4846 base pairs 

(B) TYPE: nucleic acid 

w (C) STRANDEDNESS : double 

(D) TOPOLOGY -. linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 

2 0 TTCGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 24 0 

TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

25 ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 360 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 4 80 

30 GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 54 0 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

35 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 840 

TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

4S CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 102 0 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

SO GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 

55 
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TTACCGCAAG CTGCTAATAC AACTGCAAAT GTTAATACTA AAATAAGAC C AAATAATTTT 1380 

TTCATAAAAT GAAACCCCCA ATTTATCGTT TATCAAGTTT ATTTGTAAGC CAATCCCCAA 1440 

TGAATTGGAT TATAAATACA ATAATTAAAA TAAAAACTGT TGATACTAAA ATGACATCAT 1500 

TTTGATTTCG AGTGAAACCT GTTAAGTATG CTAAATTTCC TAAACCACCG GCACCAATTA 15 60 

CACCTGCAAC TGCTGTTGAA CCAACTAAAG CGATTGCTGT AACTGTAATG CCAGACACTA 162 0 

GCGCTGGCAT AGCTTCAGGT AAAAGGACTT TACGAATTAC TGTCCAAGTA TTAGCGCCCA 1680 

TTGACCAAG C CGCTTCGATG ACACCTTTAT CAATTTCTTT AAAAGCAATT TCTACGAGCC 174 0 

TTGCATAAAA CGGTGctGCG CCAATGATCA AGGCTGGTAA CGCACCTGTC GGACCACTTA 1800 

TCGTTCCAAG TATCAAACTT GTAAATGGAA TTAATAATAA AATTAAAATA ATAAATGGTA 18 60 

TCGCTCTAAA TAAGTTAACA ATGAAAGAAA CGATAGAATA AAATAACCTT GCACCGATAG 1920 

ACTTACCTTT AGCAGACAAG AATAATAACA CACCTAAAAT AAGACCAAGT ATAAATGCAA 1980 

ATATAGTTGA GACGACTGTC ATGTATAGTG TTTCGACTAT TGCAGTCCAA ACTTCTGGCC 204 0 

ACTGAATATT AGG CATTGTA ATCATTTCAT TTATAATTTC ACTAAATGAT TTACCCATGT 2100 

CTTAACACCT CCATTTTAAC TTGTCGCTCA ATTAACTCTT TTTCGAATTT TCCGAAATCT 2160 

ACACTTGAAA TATATGGAAT ATGCAGAACT AAAAAGCCGA CTGTTCCATT TTTTGTATTT 222 0 

TTAATATTTG CTTCTAAAAT ATTAATTTTA ATATCATAGG CAGTTGATAG ACTCGATACA 22 80 

ATAGGCTCGG TTGTTGTTGA ACCAGCGAAA ACTAATCTAA CGATATATGC ATCTTTTTCT 23 4 0 

AATGGCTCTA ATTCTGTTAA AGATGTTTCG AAATCATCAT TTAAATCGTC TTTCACAAAT 24 0 0 

CGTTTTGTCA CAGTGTGTTG CGGATTTTCA AAAACCTGTG TCACCGGTCC TTGTTCTATC 24 6 0 

ACTTTACCAC TTTCCATAAC TGCAACTTCA TCACAAATAC GACGAATGAC ATGCATTTCA 2 52 0 

TGCGTAATTA GTACAATTGT TAAATTTTGT TGTTCTCTAA TTTTTAGTAG TAGATCTAAA 2580 

ATTTCATCTG TTGTTTGCGG ATCAAGTGCA CTTGTTGCCT CATCACAAAG CAAGACCGTT 264 0 

GGATCATTaG TAACGCTCGT GCAATCCCAA CACGTTGCTT TTGTCCACCT GATAACTCTG 27 0 0 

ATGGATAAGC CTTTTCTCTA CCTTTTAAAC CGACGAGTTC GACAAGTTCT AATGCTTTTT 276 0 

GCTTAGCTCT CCTTCTAGGG ACACCTGCAA TTTCAAGCGG AAACATAATA TTTTTTAACA 2820 

CAGTCCTTGA CCATAACAAA TTAAAATGTT GGAAGATCAT ACTTACTTTT TGTCTTTTTG 288 0 

CTCTTAATCC ATTTTTGGAC AATTGACCTA TATGGTCTCC AT CTATAATA ACTTCACCTG 294 0 

ATGTAGGCGC TTCTAAATGA TTAAACATTC GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 3000 

AACCAATGAC GCCATAAATC GATCCTGCTC GAATCGATAA ATTAACGTGA TCTACAGCAA 3060 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA AATGCTTTCT CAATATCGAT AGAAAAATTG 3180 

AGAAAGCAAT AGTAGTATTG TTTCTCTCAT CTTCAAAAGT TAAAACTTTA TGTGAATTGG 324 0 

5 CACCATTTCT ATATAAGACG GTTGCCGGGC TTCGTAGGGC ACATCCCTCC ACCACTCTCG 3300 

ATAAGAGTTT ACGCATCATT TAATTTGTAT TAATCCTAAC ACCTTAGTAA AATTTCGTCA 3 3 60 

ATAACTATTT TAAATTTTCT AACAAATCAG TCACCGATTT AAATGCATAA ATTCGTTTTA 34 2 0 

10 

CTTCTTTATC TTTATTCATC AACAATAAAA TCGGCGTAGA CATGATTTGC ATATCTTTAC 34 8 0 

AAAACTGAGG ATAAAAGTTT AAATCTATTT TCAATAATGG TAACTGCAAT ATTTCATTAG 3 54 0 

CAATGTCTAA CATTCTTTCT GaAACCTTAC AAGTACCACA CGTTGGTGTA TAACCAAAGA 36 0 0 

15 

TTAAATGTTT GTCTTCCTCA TAAAATGTAG TTACATCTTT GATGTCTAAT GAATTATTCA 3660 

TTTACTAAAA CTAACCTTTC ATTATTTATA TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 372 0 

2 0 GTAAAGC CAT GTTTTGAAAG TACATGCGCC AAATATTGTT TGGGGCAATT CGCAACTTGA 3 78 0 

CAGTAAGTTT TATCAATAAA TATATGTTCA CTTTCACTCA AATAACGTTT AAACCAATTT 3 84 0 

CTAATTCGAT CTCCTTCGTC ATCAGAATCG GCTAATACAA AAACTTGTTT ATCATACAGT 3 90 0 

25 GATTCTATCA TATCATCAAG CTTATCTATA CTCATTGTTC CATGAGTACA AATAATATTG 396 0 

ACTGGTTCTG CAATAACCTG TTGCACCCTT TTTTTATCAG ATTTTCCTTC AACAATTATC 4 02 0 

ACTTTATTTA CAATAGCCAT CATCATCACC CTTTAAAATC AATAAACATC TGTCACTGTA 4 08 0 

30 TCATTTCACA AAATTGGTAT GAATAAAACA TAAATCACAA AAAATTTAAA CTAGCTTAAT 414 0 

ATAATAATTA CAAACTCAAT GTTTGACTAG CTGGAACATT TAACATAAGC AGACAAAGGC 420 0 

TAAGTCAAAA ATCAACATCC TAAAATCTAC AATGTTATAT TAACAATAGT TAACCAAAAG 4260 

35 

AAAAT ACAC C TATAACAAAC TTTTCAATTA TAGCGGGGCC CCAACACAGA AGCTGATGGT 432 0 

AAGTGAGCTT ACAATAATGT GCAAGTTGGC GGGGCCCCAA CATAAAGAAA TACTTTTTCT 4 38 0 

TTAGAAATTA GTATTTCTTA TGCATGAGTT TTACTCATGT ATTCCTATTT TTAAATACAC 444 0 

40 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG ACATAAATCA ATGTTCTATG CTCTACGAAg 4 50 0 

TTATATTGGC AG TAGTTG AC TGAACGAAAA TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 456 0 

4S GTCAGGGGCC CCAACACAGA GAATTTCGAA AAGAAATTCT ACAGGCAATG CAAGTTGGGG 4 62 0 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGCA AGTTGGGGTG 4 680 

GGACGACGAA ATAAATTTTG CGAAAATATC ATTTATGTCC CACTCCCTAG ATTGATCTAT 4 74 0 

SO AGATACTACA CTTATTAAAG TAATATATTT TTATGATTCT CTTAGCTGCA ATCCCATGAA 4 80 0 

TACATGTAAT CATCAAACTT CATAGCCTCA AGGTCAGTAG ATTTCA 4 84 6 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

s (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



w (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 60 

GATACACCAA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC CGAAATCGTC AGTCACTCAT 12 0 

15 AACGCAGAGA CACCTAAGGT TAG AAAAG CT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 180 

ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAAGAACAT 24 0 

TTTGAAGGTT ACGGAAGTGT TGATATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 300 

20 GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 36 0 

AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA TAACCATCAA 42 0 

TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 48 0 

25 

TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 54 0 

ATTGATACTG GATACATTTA TACAAGTTCC ATGGACAAAA CTG AAAAG CA AGCTGGACAA 600 

30 GGTTATAGAG GATACGGAGC TTTTGTGAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 66 0 

GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 72 0 

ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 780 

35 TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 84 0 

TTAGGTTTAT CTAAAAATCA GGCATATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC 90 0 

CTTAATCAAG GGATAAATGC AAATGGCTGG ATGAGAACTG ACTTGAAAGG TTCAGAGTTT 96 0 

40 ACTTTTACAC CAGAAGCGCC AAAAACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 102 0 

CCATTCAAGA AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAGTAACA 1080 

AGAGAAGGAC AAAAAGG TGA GAAGACAATA ACGACr CCAA CACTAAAAAA TCCATTAACT 114 0 

45 

GGAGwAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATCC GATTAATGAA 1200 

TTAACAGAAT ACGGACCAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 1260 

SO TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 1320 

GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAGGAGAC 13 8 0 

TCGATTGTAG AAAAAGAAGA rATTCCATTC rAGAAAGAAC GTAAATTTAA TCCTGATTTA 144 0 

55 
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CCAACACTAA AAAATCCATT AACTGGAGAA ATTATTAGTA AAGGTGAATC GAAAGAAGAA 1560 

ATCACAAAAG ATCCGATTAA TGAATTAACA GAATACGGAC CAGAAACGAT AACACCAGGT 162 0 

CATCGAGACG AATTTGATCC GAAGTTACCA ACAGGAGAGA AAGAGGAAGT TCCAGGTAAA 16 80 

CCAGGAATTa AGAATCCAGA AACAGGAGAT GTAGTTAGAC CACCGGTCGA TaGCGTAACA 174 0 

AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc CaTTCaGGAA 1800 

AGAcGTaATT TaTCCTGTTT AGCACCCGGG GCAGAAAAGT TAC 184 3 



(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TAAACAGCGC GTGTACTTGT GATTCCCCCT TCTTCTATTT TACCCACCCG GGAAATAATA 6 0 

25 CTTTT CGCGA TTCCTTACTT GaACAAGCAA TATTTTATCa GCTGTTTCTT CAACTAAACA 120 

GACACATTTA ATCATCTTTG ACACCCCAAC TTTGTGAAAT CAATTTTTCA AATTATACTG 180 

TACAATTATG TTATCATATA TGAGTAGTTA TAGCGCAAAA CGTTAGCAAT TCAGCGCACC 24 0 

30 

CAACTTTTCA TATAAACAGA AGATACTAGG GGGAATTATT ATTATGGCAA AACgTTCCAA 300 

ATCACAACGT TTATCAAGTT TACTAAATGT CGCAGGTTTC ATAGTCGACG GCTACAATGG 36 0 

3s CTATAAATAT CATGCTAAAA ATAAAAAATT AGTATATCTT TCATTAGGTT TAAGCACTGT 420 

AGGAACCGTG TTAGACTTTT ACATTTCAAT TAAGTCACCA CGTAAGTTCA AAAAAGCAGT 480 

GGCAGTTGTT ACTTTAATAA CAAACGGTGC TAGATTATTT ACAAGCATTC GCAAAGTAAA 54 0 

40 ACATGAATAC TAATTCAGAA AAGGATTGGT CGAACATAGA ACATGAAGTT CATTCGACCA 60 0 

ATCCTTTTTC ATATACAAAA ATTCCTATTA CAACAATCAC GACTTTGATA GCCCCGCCAT 66 0 

AAAATTTAGG ATTCAATCCA ACTTTTCAGC TTGTGaAATG TAATAGGaAT TCATTATATA 720 

45 TTTATATACG TAAGACTTTA GTGAATATAT CTATAATTAT TTACTTGGTA AGCTGGTACC 78 0 

GTTCTGTAAG TTAAAAT AC C TGGTGCAGCT GAATAGTTCA TTTCTGAAAC TAAAATACTA 84 0 

CCATCGTTAT TTACACGTTC TACAAACATA ACATGACCAT AGTAACCTAC ATCTGTTTGA 900 

SO 

GCGATAGAAC CTACAGTAGG TCTATTGTCG ATAGTGTAAC CATCTGCTGC CGCTGCGTTA 96 0 

TCCCAGTTAT TAGCATTCCA CCAATAAGTA CTAATACCTT TACCAATTTC AGCACGACGA 10 2 0 

55 
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GTATTGTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT 114 0 

ACTTTCAATT TTTGACCCGG ATATATAAAG AAATTATTTA AGCCATTTAA GCTCATAATT 12 0 0 

TTTTGATAAG TTGTACCATA TTTTGATGCA ATTAATGACA ATGAGTCACC TGCTTGTACT 1260 

GTATAGTATG ATCCGCCACC TGAGTTCGTT GATGGACGGC TACTATTGCT CGCAGCGTTA 13 20 

CTTGAG CTAG CAGTACCTGA TACTTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 1380 

AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ATTTTGATGC GATTAATGAT 14 4 0 

AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG 15 00 

CTACTATTAC TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGGTT TGGGAAAATT 15 SO 

AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 162 0 

GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 16 80 

GTTGCCG CAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTG CAATT AC TTTTTTTTGC 174 0 

ACTTTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA 180 0 

CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA 18 6 0 

CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 192 0 

TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 198 0 

ACAATCCCTT TCTCTATTGG AAATAATTTT ATTCTCCAAC AATAACGCCC TACAAACATA 204 0 

AGCATGAACT TTGCTTGTAG GACGCAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 210 0 

CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCCTG CAA 216 0 

CGCTTTTTGA GCTTCTTTTG AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 222 0 

CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 228 0 

TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCGTT 234 0 

ATTAATAACT GCAATATCGA CATCAGATAA AGCACGTGCA GTTTGTTGTG CATCTACTGC 24 0 0 

AGTAATTTTT AAATGTTTTG GATTTGACGT TATATCTTTC ACCGTGCCTG CTAATCCGAA 246 0 

ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 252 0 

TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTTA ACCTTTTTGA CATCTTTAAT 258 0 

TTTATCTGAG TAAATGCCCA ACGGTGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 264 0 

TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 270 0 

AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 276 0 

CACATCAATA TCATCTTTTT TAGCTAATTC TTTAACCTTC TCCCAAGCCT TAGTGTCATT 282 0 
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AAGTGCTACG ATAACTAACC CAATCAATCT TTTCATTCTA TCAATTCCTT TCAAAATCTT 294 0 

CACTATATAT CATTAATGTC TACGTATGAA TCTAGCTAGA ACATTCCCTA G CGTTTG AAT 3000 

S CACTTGGACA ATAATGACTA ATACAATAAC GGTAATAATA ATGACCGTCG TATCAAATCT 3 06 0 

TTGATAACCA TACACTAAAG CTAAGTCTCC TATACCACCA CCGCCAACAG CTCCTGCCAT 3120 

CGCCGTACTT CCAATAAGTC CAATAATCGC AGTGGTAATT GCTAATACTA ACGAACCTAA 3180 

'° AGCTTCAGGA ATTAAAAAAT ATCTAATGAT TTGTAGTGGT GAAGCGCCCA TCGnTTTCGC 3 24 0 

CGCTTCAATA ATCCCCTCGT CTACTTCCAA TAATGAGTTT TCAACAAGTC TTGCAATGTA 3 3 00 

AGGTGCCACA TATACTGTTA AAGGCACGAT GGCAGCAGTC GTACCAATTG AAGTAC CTAC 3 3 60 

75 

TACTAATTTT GTGAATGGCA CAATCGCAAT TAACAAAATA ATAAATGGTA GTGACCTTAA 34 20 

AATATTGATT AAAGGATTTA AAACTTGATG TATCACTATA TTGGGCCATA TGCCTTGTTT 34 80 

20 TCGAGTAATT ACCAATAAGA CAC CTAATGG AATACCAATC ACTGCTCCTA AAAATAAAGC 3 54 0 

AATAGATACC AT AT AT AG CG TTTCGTACAA TGCTTGTAAT AACTG TGCAC TGTCTAAATC 3600 

AGAACCAAAC ATATGTTAAT GcACCTCCTC AAATTGAATA TTTTTCTCTT TGAAATATTG 3 6 60 

25 ATTTATTGCC GTGTCTTCAA ATTGTTGATC CATATTAAAT CGAAGCCACA TATAACATAC 3 720 

GGTGTTACCT TGTATTTCTG ACATAGATGA AAATAAAATT TTAACCTCTC TGCCACAAAT 3780 

TTGAATCAAG TCATTTATAA TCGGTTGTGT CACCTGAGTT TCCTCGACGA AGATTTTATA 3 84 0 

30 

ATCTTTAAAA TCGCCAACTT GTTCGTCATT CAATCGACGA AT CAATG ATG TACTTGGCTC 3 90 0 

AGTCTGTATA ACTGTAGACA CAAAATTTTG AGCAATCGTC GTTTTAGGAT GACTAAACAC 3 96 0 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT CCCCTTTTCC ATTACAGCAA CACGATTACA 4 02 0 

AATGTCTTTA ATAACGCGCA TTTCATGTGT AAT CATCATA ATTGTAATGC CAAAGGTTTG 4 08 0 

ATTG&CATTC TTTAATAACG TCAATATCGA AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 414 0 

40 TGCTTCATCG CATAGGAGTA TTTTCGGATT AG T AACAAG C GCTCTTGCAA TAGCCACCCT 4 20 0 

TTGCTTCTGC CCACCAGATA ATTCATCAGG AAATTGG TCT TTTTTATCAC TCAATCCTAC 426 0 

AAATTCAAGC ATTTCCGTTA CTCGTTGCTT AATTTCTGTT TTGCTTTTCT TACTTAAAAT 4 32 0 

45 GAGTGGCATT GCTACATTTT TAAATACGGT AGCTGAATTT AATAAATTGA AATGCTGAAA 4 38 0 

TATCATACCG ATATCTTTCT TAATATCCCT CATCATTTTA TCGCTATAAT TCGTAATATC 444 0 

ATGTCCATCT ACAATCACTT GTCCATTCGA GGCAG CTTCA AGATGATTCA CGAGTCTTAC 4 50 0 

SO 

CAACGTACTT TTTCCTGCAC CACTATATCC AATCACACCA AAAATATCAT TGCGATTGAC 4 56 0 

CGTAAATGAT ACGTCCTTCA AAGCATCTAT TTTTTGCTTC TTTTTATTAA AGGTCTTACT 4 62 0 

£5 
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TAAGAGCATT ATATGTAAAA TTGCATATAT 
AATTAAAAAA TAAGTAATTC ATGTGACAAT 
5 CAAATTAAAT CCATAAAAAT GTCCACCAAT 

TCGCACAAAA AAGCACTAGC ATATTCAAGA 
AGTGCTGCTA TTATTTATAA AGTATCTAGT 
GTATCTTCAA TACCTACAGA AATTCTTACA 
ATATCTGCTG GAATGGATGC ATGTGTCATC 
GCACCTAAAC TTTCAGCTAA TGTGTAATAC 

15 

TCTGTATTTT TCACTTCAAA TGCAATCACA 
ACATCATGAT TTAAATGACT TTCAATACTT 

20 GCTTGTAACA TTTTAATAAT TTCAATAACG 

AATGTTTTAA TACCCCTCAC AAGTAAATAG 
GAATTTGAAA TAAATGCTAA ACGTTCTGCA 

25 GCAACGACAT CACTATGTCC ACCTAAATAT 

CCTAAATCTA ATGGATTCTG ATAATAAGGT 
AAACCGTGTT CTTTCGCAAT TTCAGCAGAC 

30 

TTAGAAGGTG TTTCAATAAA CAACATCTTT 
TCTGTATGCG TTGTATCTAC AAAATCCACT 
AATGCGCGAT AAGTACCGCC GTATACATCT 

35 

AACAGCATAA CAACTGCACT GATTGCTGCA 
CCATTTTCTA ATGTCGCAAT AACGCTTTCT 

40 GAATATTCAT ATCCTTGACG TAAATCACCA 

TAAATTGGTG TTGTAACGGC ACCTGTATAA 
TTAGTTTTCT TGTTCATTAT TATTCTCCTC 

45 ATCACTACCA TCTGGAAATA CGACAACAAT 

CAATGCACCT TGTAATGCTG CACCTGaAGA 
CAAACTTTTG ACATTTCGAA AGGCATCTTG 

50 

ACGTCTCTCT AAAAATATCG GCCATTTCTC 
AG CTGGCCCT CCATTTAACA CGGACCCTTC 

55 
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CGTcAATACA ATTTGCCGAA TTTTCTAAAA 4 74 0 

GAC GAATTGT GAGACTACTA TGACATTTAT 4 80 0 

CCTCCACAAC GCAATTACTA AATATTAACA 4 860 

ACAACAAACG TTGAACTCAA AATATATGCC 4 920 

GCTTG TTTTA AATCATCGAC TAAATCTTCA 4 980 

AGTCCGTCTG TAATACCTTC TTTAGCTCGA 5 04 0 

AATGCAGGTA CTGAAATTAA ACTTTCCACT 5100 

GATGTTGCTT TAATCAATTG TTTGGCACTT 5160 

CCTGTATGGC CATCCGCTTG AGCCATATGG 5220 

GGATGGAACA CTTGTTGCAC AGCTGGATGT 5280 

CTGCGATTAA TTTGTTCCAT ACGTAAACCT 534 0 

CTATCTTGAG GTCCTAAAAT GCCACCTGTT 54 00 

AGCTTGTCAT CCGATGTTGC AACTAAACCA 54 6 0 

TTCGTTGCAG AATGTAAGAC AATATCGATA 5520 

GTCATAAATG TGTTATCAAC AACTGAAATC 5580 

TTTTTAATGT CAGTAACACG TAATAATGGA 564 0 

GTTGTTGGGC GTATCGCTTG TACAATTGAA 57 00 

TCAATGCCAA ATCGTGTAAA TACTTTTGTC 57 6 0 

GAATTTAAAA TAATATG AT C TCCTTTGTCC 5820 

ACACCTGAAC TAAATGCAAA GCCATGTTTG 5880 

ACAGAACTTC TTGTTGGATT CGCAGTACGA 5 94 0 

ATATCATCTT GTAAATATGT ACTTGTTTGA 60 00 

TCGTCTGTTG TGTGCCCACC ATGAATTAAT 6 060 

ATAATTAAAT ATTTG CTTAG ACATATATCG 6120 

CGTACCTTCA GATAATTGCG CTTTTAAATT 6180 

ACTGCCTACT AACAACCCTT CATTTATAGC 6240 

ATCTTTAATC GTAAATATCC CATCTACAAG 6 3 00 

AGAACCGATA CCTTCAGTGT CATGTGCATG 63 6 0 

TGGCTCAACG GCATAACATT GCACGTGATG 64 20 
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ATAATCAATT TGCTGTAATG CTGAAGTCAA TTCGGGTCCC AATGTATGAA AATATGTATC 6 54 0 

CGGATTATGT TCGGATTCAA ATTGATTCAT ATAAACGGCA CCATATTTTT CAG CATAGGA 6600 

5 ACGTGCAGCT AATTGTGCCC CATGCATACC TTCAGACTGA CTCGTCCTTG AAACTTCTGC 6660 

ACCAAGCGCT ATCATAATAT TAATCTTTTC TTCTGAAAAA CCATACGGCG CAAAGATCTT 6 720 

ACATTTCAAA TGATGTCTAT TCGCTGCAAT AGCTAACCCT ATGCCTGTAT TACCAGCAGT 6 780 

'° CGCTTCAACA ATAGTTTGAC CTGCACGCAC ACGCCCTTCT TGAATTGCCT TCTCTACTAA 6 84 0 

ATATTTCCCG AGTCTGTCTT TAACACTGCC TCCAGGATTC CATTGTTCAA G CTTGG CATA 6 9 00 

AATTTTAACT TTATCATCAC TATAATGTTC TAACAGTACT AATGGTGTAT TGCCaATTAA 6 960 

is 

ATCATAAGTA ATCATAGATG CACCCTCATC TGACATGCCG ATCAAATGAA TGAAACCTTT 7 020 

CTTCATGTCT CAATTTTAAT TCTTACTTTT CAGATAAGAA TTATAAACGA CATTTTGTTA 7080 

20 TTTTGCAATT ATCTAAGTTT CGATTAATTC AGAACCAGTA CTAAATTTTC AATTC CAAAC 714 0 

AAAAAAACAC CTGAGCAACA CAAATACTTG TGTGTCAGAT GCTTCTATAT ATTAACTAAA 72 00 

TAATTGCACG ATAAAGACTA AAATAATAAC GACAGGCATC GCATACTTAA TTAAGTAATA 7260 

25 CCAACCACTG AATAATCTAA ATCGATCTTT ACCAAAATAT TGTTGTAATA ATTTTTTATC 7320 

TAATAATTGT CCTACGACAA GCGTAGTACC TAATGCGCCT AATGGCATCA ATACATTCGA 73 80 

AACGATGAAA TCCATATTAT CAAAAATCGT TCCCGCACCG AATCTTACAT CTTTTAAGAT 74 4 0 

30 ACCAAAAGAT AAGGTTG CTG GAATACTAAT GATAAATACT AAAATACTAC CG AT CACTGC 75 0 0 

GACTTTTTTA CGTTTTGTAT TGTCATTCTT CGTGAAGTTA GAAACATTTA ATTCTAATAA 756 0 

AGAAATAGAT GACGTTAAAG CCGCAAATAA GAACAGCACT AAGAAT C CCA AATAGAATAA 762 0 

35 

TGTGCCTAGA TGCATTTGAC TAAAGAC CAT TGG CAG TACT TTAAATAATA ATCCAGGCCC 76 8 0 

TTCTS-GTGGT TCATAGCCAA AACTATGTAA AGCCGGAAAT ATAGCTAGAC CTGCCAATAC 774 0 

40 AGATACAAAG ATATTCATAA CAACGATAGA AATAGCTGAT GACTTAATCG TCATGTCTTT 78 0 0 

AGAGGCATAA CTCGCATAAG TAATCATACC TGTAGTTCCT AATGATAACG TAAAGAATGA 78 6 0 

TTGACCTAGC GCAAACAAGA TGCCATCAGC AGTAATCTCT GATACTCTTG GTTGTAAAAT 792 0 

45 AAATTTCACA CCTTCTAAGA CGCCATCTAA TGTTAAAGAC TTAATCACAA TGACGATTAA 7980 

AAAGACAAAC AGCAATGGCA TCATAACTTT CGATGCCTTT TCTAATCCTT TTTCAACACC 804 0 

TAACATGACA ATAATCATCG TAGCGAATAT GAATATACCT TGCCCTAGAA CGGTTAACCA 8100 

50 

AGGATTTGAT ATTACCGCTT CAAAATTCAT TTCTTGGAGA TGATTGATGC GTTGAAATAT 8160 

AACTAATTGC CATAATACTT GTCCGATGTA AATGACAATC CAACCACCGA TAACACTATA 8220 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 8340 

GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 84 00 

AAAGGCACCG CCG CCAT AAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 8460 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 

TCTTTTCATC ATTCAC 853 6 



(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA TCATCAACAA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 240 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 3 00 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 3 60 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

AGTGATGACA TGATAGTTAT CAG TAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 4 80 

GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 54 0 

TTTflTAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAG CTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG TTCAAAATAG TCAGTTGCAC 780 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 84 0 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 1020 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACT AAT AAA TAATGCATCT TTTTCAGATG 10 80 
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TAATTTCTTT ACGCCCTTTT TTCTCTAAGC ATTGCCATTT TTCTAACCAA CTTACGCGAT 1200 

TAACTGTCGT GTCTTCCATT AATGAC CTAA AGAAATCATT CGCAGAAATC TCATATGAAA 1260 

TATCTGGCGC TATCGGAAAG ACATCAATCT TATCATTGTT TTGCAC TAAT ATTTGAAATG 13 2 0 

CATCAGTTTT CTTTAACCAT TGATTTAACT TTTTAGAAAT CACTGGTTTC CCAACACGAA 13 80 

TTACGAAATC CACATTTAAG TCTAAGCCGC TTCTAAACAG CAAATCATAT GTACAGATAA 144 0 

CATTCGGATG ATCAAATTTT CTTAAATGAC TTAAAGGATC AGCTAAAATA GGCAAATCAT 15 0 0 

ATATCGTTGA ATACGTTAGT ATTTGaTCAA CTTCTTGGTG CTGCATATCC CCTACAATAA 156 0 

TTAAACCTTT TTTCTTATTT AAAATGTGTC TTAATGCCGA TGCATCTATA CTTTTTTGAT 162 0 

AGTGCGGTAA AATCTTCATC TCAGAAGTTA ACAATTCTGT TGCATTCAAA TCAGGTGTTA 16 8 0 

ACGGATCTCT AAATGGCAAG TTAAAATGAA TTGGCCCTTT ATGTGGTCCA TATAAATATT 174 0 

GACTAGCAAT TTGCATTTGA TAGTAAATTG CATCAATGGT CTCTTTACTA TCATCCGCAA 18 00 

TAGGCATATC GAACTCATAA CTTACATAAT TATTAAACAT ATTTACTTGA TTAATCGCTT 18 6 0 

GTGGTGCGCC TACACTTCTT AATTCATGCG GACGGTCACT TGTTAAAACG ATTAAAGGAA 192 0 

TTCTACTAAT TTGGCTTTCA GCAATTGCAG GCGTATAATT CGCTGCTGCT GTACCTGACG 198 0 

TACATAATAT AGCGACAGGT CTTTCACTGC CTTTAATTAA CCCAACTGCA AAAAACGCTG 204 0 

CACTTCGCTC ATCGGGGTGT ATCCATGTTT TAATATTTGG ATGTGCTTCA AATGCAAGTG 2100 

CAAGTGGCGT TGAGCGTGAT CCCGGACTGA TAACTACTTC CCTTACGCCG TACGCATATA 216 0 

ACTCAGATGC AAATGTAAAA ACTTGCTTCG TTAAAGCTGC TTTATGATTT CCCATTCATA 222 0 

TCGACTCCTA ATG CATTC AT CATAGGTGTG AACTTAAGGT TCGTTTCTGC CAATTCACTA 22 8 0 

TCTGGATCAG AATCTTTAAC AATG CCACAC CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 2 34 0 

ATAAGCATCG AACGAATTGC AACAATAAAT TCACAATCAT CGTATATATC TATATAGCCA 24 00 

ACCGGTGCAC CATATAATCC TCGCGTACCA AATTCTTTCT GCTCAATAAA ATCCATTGCA 24 6 0 

AATTCTTTTG GATAGCCACC TAAAG CAGGT GTTGGATGTA AATTATCAAT TAAACTAATA 252 0 

TACGAATCAT CCTTCAGTGG CGCCTTTATT TCAGTGTACA AG TG AT AT AA ATGATCATTT 2580 

TTTAGAATTT TAGGCG TCTT AT CATAATGT AATTCAGTGA TATAAGGTTT AATATCATGT 264 0 

AAAATACTGT CAACAACAAA TCGATGTTCG ATTAAGTTTT TATTATCTTT TAAAAATGCT 270 0 

TCAACATTTT TTGTATCTTC GTCCTCATCT TGTGAACGTT TAATTGTACC TGCTACAGCT 276 0 

TTAGTCGATA GTATTTTATT ATTGACCTTT ATTAATTGTT CAGGTGTTTG TGAAAAGAAT 2820 

ATAGAATCTT GTGATTCTAA CAAGAATATA TAACTGTTTT TTTCTTTAGA ATATGCTTGC 2880 
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ACAATTTTTT CTTCATTATT AATAGATTCT ATAGCTTCTA CTACAAGTTG ACGCCAGTCA 3 0 00 

TCTTTATAAA TATCTTCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 3 06 0 

5 GATATATTGT TGAACAAATC CATTAAATCG TTCAATGCCT CAACAGTAAA ACTTTCCCTT 312 0 

TTAACTGTAT AAGTTAAAAA TGTCCCATTA TTATCAGTTG AAATTAAAAC TTCAGGTAAT 318 0 

ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTG ATTTATGACT TGAAAATTGG 324 0 

AACCCTCCAA CAACTCGAAG ATGATGTTTC TCAGATTGCG GATGTATAAA TGTGATGTTA 3 3 00 

TGTTTTAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTTTGAAT 33 6 0 

AATTGAATTG CTTTGTAGCC AAAATATGAC GTTCGATTAT CATTCAAACG CATATAAAAG 3420 

15 

CGATCTCCTG CCTCATTGTC AGTGAGATGA AATAATGTGC TCGGGTCTAG TGACTGTGAT 34 80 

AATTTCACTT CAACTGAAAC CCATTCCTTT GAGCTGCCAT ATATCTCTTT GACAATATCG 3 54 0 

20 TCCTCTAATA CGCCCGTAGC CATCCATTTC ACTTCTTTCT TCGTCTTTTT TCACTCATTA 3600 

TTATATTGTA TCATTTTTGG ATAATTGTGT TACAAGAATT GCTTAAACTT ATCTTGCAAT 366 0 

TTTTCACGTC AATTGACCTT TATG CTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA 3 72 0 

25 TGATTTAAAG AGGTTTTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 3780 

AAATATTGGC ATTTAATGCG TCCTCATACA TTAACTGCTT CCGTAGTACC CGTTTTAGTT 384 0 

GGTACAGCAG CATCTAAAAT ATATTTTCTT GGTAGCGAAG ATCATATTAA AATCAGCCTA 3 900 

TTCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3 960 

TACTATGATT ATAAAAAAGG CCTCGATGAT CATGAATCTG TAGGCATTGG TGGTGCCATT 4 020 

GTTCGCAACG GTATGAGCCC AGAGCTTGTG CTACGATTAG CCATTGCATT TTACATCTTA 4 080 

35 

GCAGCAATAT TAGGTTTGTT TTTAGCTGCT AACTCTTCAT TTTGGTTATT ACCAGTTGGA 4140 

TTAGlATGTA TGGCTGTTGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 4200 

40 CCTTTCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT TTATTATCGT TATTGCATTC 426 0 

TTTATTCAAA CTGG CAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 432 0 

ACTATCGG 4328 

45 (2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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GTTCAATACA GAAAAAATAA ATTTAGATGT TGAAGCATCC TACAATTAAT ACAGATCCAT 60 

TTCAATATAT TTAAACTAAA ATCTCGGGAT TTCTAAATTT TGAAATTTCG AGGTTTTnAT 120 

ATTTTTATTT AAAATAGCAC ATTTATACTT TATAATAGTA AAGATGAACA TATAAGGAGG 180 

CCAAATCATG GCAAAACATC CATTCGAACA ATTTAATCTA GAAT CTAGTT TAATTGACGC 24 0 

TGTGAAAGAC CTTAATTTTG AAAAACCAAC TGAAATTCAG AATCGAATTA TTCCAAGAAT 300 

ACTAAAGAGA ACAAATTTAA TTGGTCAATC TCAAACGGGT ACAGGGAAAT CTCATGCATT 360 

TTTATTACCA TTAATGCAGT TAATTGATAG TGAAATAAAA GAAC CACAAG CAATCGTAGT 420 

TGCACCAACA AGAGAACTTG CACAACAACT ATACGATGCA GCGAACCATT TAAGCCAATT 480 

TAAAGCTGGT GTTTCAGTTA AAGTTTTTAT TGGTGGTACA GATATAGAGA AAGATAGACA 54 0 

ACGTTGTAAT GCACAACCAC AATTGATTAT AGGCACCCCT ACTAGAATTA ATGACTTAGC 600 

TAAAACGGGA CATTTACATG TGCACTTAGC ATCATATTTA GTTATTGATG AAGCGGATCT 660 

TATGATTGAC TTAGGATTAA TTGAAGATGT AGATTACATT GCTGCAAGAT TGGAAGATAA 720 

TGCAAATATT GCGGTGTTTA GTGCTACAAT CCCACAACAG TTACAACCAT TTTTAAATAA 780 

ATATTTAAGT CATCCAGAAT ATGTAGCTGT CGACAGTAAA AAACAAAATA AAAAGAACAT 840 

CGAATTCTAT TTAATACCTA CTAAAGGTGC AGCTAAAGTT GAAAAGACTT TAAATTTAAT 900 

TGATATACTA AATCCATACT TATGTATTAT TTTCTGTAAT AGTAGAGATA ATGCAAATGA 960 

TTTAGCACGT TCACTAAATG AAG CTGGTAT TAAAGTTGGT ATGATTCATG GTGG CTTAAC 102 0 

GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 1080 

TGCCaGCGAT TTAGCATCTC GTGGTATTGA TATTGAAGGT GTTAGTCrTG TCATCaATTT 114 0 

TGATGTGCCA AATGATATTG ACTTCTTTAC GCATAGAGTC GGACGAACTG GTCGTGGGAA 1200 

TTATTrAGGT GTAGCAATTA CGCTTTATAG TCCTGATGAA GAACACAATA TTTCATTAAT 1260 

AGAASATCGC GGTTTTGTAT TCAATACTGT TGATATTAAA GATGGTGAGT TAAAAGAAGT 132 0 

TAAAGCGCAC AATCAGCGTC AAGCAAGAAT GCGCAAAGAT GAC CATTTAA CTAATCAAGT 13 80 

GAAGAACAAA GTTCGAAGTA AAATTAAAAA CAAAGTTAAA CCAGGTTATA AGAAGAAATT 1440 

TAAACAAGAA 1450 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

AGTCAGGTAT ATCATGCCaT yCTGAATTGG TCGATATTAA TATCAGTGGT GTTAAAGAAC 60 

GAATTGTATA CCAATAGACG CTTTATATTG TAAAATAGTA TTAAATGCaG AATAGAGAGG 120 

AGATTTAATG CGATATGACA AATTATAAAG TTGTCGTTTT AGATATGGAT GACACATTGC 180 

TAAATTCAGA TAATGTGATA TCAGAAGAAA CTGCAAATTA TTTAACAGCA ATTCAAGATG 24 0 

AAGGTTATTA TGTTGTTCTA G CATCTGGT A GACCTACTGA AGGTATGATT CCAACTGCTA 3 00 

GAGATTTAAA ATTACCTGAA CATCATAGCT ATATTATTAG TTATAACGGT AGTAAAACGA 3 60 

TTAACATGAC TAATGAAGAA GTAGAAGTAA GTAAATCGAT TGGTAAGCAA GATTTCGATG 4 20 

AAATTGTAGA TTATTGTCGA GATAGAGGCT TTTTCGTTCT TACATATCAT GATGGTCAAA 4 80 

TTATTTaCGA CAGCGAACAT GAGTATATGA ATATTGAAGC AGAATTAACA GGTTTACCGA 54 0 

TGAAACGTGT TGATGATATC AAAGCGTATA TTCAAGGCGA TGTACCCAAG GTCATGGGTG 600 

TAGATTATGT AGCGAATATT ACAGAAGCTA GAATTGATTT GAATGGTGTG TTCAATGATA 6 60 

ATGTAGATGC TACGACAAGT AAG C CATTCT TCTTAGAATT TATGGCCAAA GACGTTTCAA 720 

AAGGTAATGC AATTAAAGCG TTATGTCACA AATTGGGATA TTCGGTGGAT CAAGTCATTG 7 80 

CTTTTGGTGA TAGTATGAAT GATAAATCAA TGTTTGAAGT CGCAGGTCTA GCTATTGCTA 840 

TGGGGaATGC ATCAGATGAA CTTAAGCAAT ATGCAAATGA AGTTACGTTG GATCATAATG 9 00 

AAAATGGTAT TCCACATGCG CTCAAAAAAT TGTTATAAAT TTTAAAATAA GCCTTAACAC 9 60 

ATGATATTTG AATAAGATAT CTTGTGGTTA AGGCTTTTTA TTTTTGTGAA AATGACTTCA 10 20 

GTTATACTAT GGAGGATTTG AAATACATAT TTTAGATTAG TAATGATATC AAACGAATAG 10 80 

AGTAAATGTA TATTTttTGA ATAAATCAAG TATTAACTAG TCACGGAAGG nAGATAAAT 1139 
(2) INFORMATION FOR SEQ ID NO: 282: 

" (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 60 

GCTGTGATAG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 

AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 
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AATGAAGTTG TTAAAATAGA TTTATATACT TCAGTGCCTA ACAAACAAAA TATACAAGAT 30 0 

GTTAAAGAAA TGATAGAACA TCAACAAATC GATGCATTAA CATTTTCAAG TTCGTCGGCA 3 60 

5 GTACGTTATT ATTTTAATGA AGGATTTGTA CCAAAATTCA AGTCGTATTT TGCTATTGGA 42 0 

GAACAAACAG CACGGA CCAT TAAATCATAT CAACAACCAG TAACAATTGC AGAAATTCAA 4 60 

ACACTCGAAT CACTAATTGA AAAGATTTTA GAAAGTAGGG GCTAAAAATG AAATTTGATA 54 0 

10 

GACATAGAAG ATTGAGATCA TCAGCGACAA TGAGAGATAT GGTTAGAGAG AAT CATGTAA 600 

GAAAAGAAGA TTTAATATAT CCAATTTTTG TAGTTGAAAA AGACGATGTG AAAAAAGAAA 6 60 

^ TTAAGTCATT GCCAGGTGTA TACCAAATCA GTTTGAATTT ACTTGAAAGT GAATTAAAAG 720 

AAGCTTATGA CTTAGG CATA CGTGC CATTA TGTTTTTCGG TGTTC CAAAC TCAAAaGATG 780 

ATATAGGTAC TGGTG CAT AC ATTCACGATG GTGTTATTCA ACAGGCAACA CGTATTGCTA 840 

20 AAAAAATGTA TGATGACTTA TTAATTGTTG CAGACACTTG TTTATGTGAA TATACTGATC 900 

ATGGTCATTG TGG CGTGATT GATGACCATA CACATGACGT TGACAATGAT AAATCATTGC 960 

CACTACTTGT TAAAACAGCA ATTTCT CAAG TGGAAGCTGG TGCTGATATT ATTGCGCCAA 1020 

25 GTAATATGAT GGATGGTTTT GTTGCTGAAA TTCGTCGTGG ATTAGATGAA GCCGGCTATT 1080 

ACAATATTCC TATAATGAGT TATGGTGTCA AGTATGCATC AAGTTTCTTT GGACCTTTTA 114 0 

GAGATGCAGC AGATTCAGCG CCATCATTTG GGGATAGAAA AACGTATCAG ATGGAC CCTG 1200 

30 CTAACCGTTT GGAAGCACTT CGTGAATTAG AAAGTGATCT TAAAGAAGGG TGCGACATGA 12 60 

TGATTGTTAA ACCTGCTCTA AGTTATTTAG ATATAGTTCG AGATGTTAAA AATCATACGA 13 20 

ATGTTCCAGT TGTTGCATAT AATGTGAGTG GAGAATATAG TATGACTAAA GCAGCGG CAC 13 80 

35 

AAAATGGTTG GATAGATGAA GAACGTGTCG TTATGGAACA AATGGTTTCA ATGAAACGTG 14 4 0 

CAGGTGCTGA TATGATTATT ACGTATTTTG CAAAGGACAT TTGTCGCTAT TTAGATAAAT 1500 

AAGGTTTTAT ATTTATGATT TTCCATAAAC TGTAGGAGGA ATTTACTTTA TGAGATATAC 1560 

40 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA AACTTTAATG CCTGGTGGTG TAAATAGTCC 1620 

AGTACGCGCA TTTAAATCAG TAGATACACC AGCAATTTTT ATGGATCACG GTAAAGGTTC 1680 

45 AAAAATTTAT GATATCGATG GTAACGAGTA TATCGACTAT GTACTAAGTT GGGGACCACT 1740 

TATTTTAGGA C AT AG AG AC C CTCAAGTTAT TAGTCATTTA CATGAAGCAA TTGATAAAGG 1800 

TACAAG TTTT GGTGCATCAA CATTACTTGA AAATAAATTG GCGCAgcTCG TTATTGACCG 1860 

50 AGTACCTTCA ATAGAAAAAG TGCGTATGGT GTCATCTGGT ACAGAAGCTA CATTGGATAC 192 0 

TTTAAGATTA GCACGTGGTT ATACTGGCAG AAATAAAATT GTG AAATTT G AAGGTTGCTA 198 0 
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GCCGGATTCT CCTGGTGTGC CTGAAGGTAT TGCTAAAAAT ACAATTACAG TTCCATACAA 2100 

TGATTTAGAT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AACGATATTG CTGGTGTAAT 216 0 

CGTAGAACCT GTTGCTGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 2220 

ATTAAGAGAT ATTACGACTG AATACGG CG C ATTGCTAATT TTCGATGAAG TAATGACTGG 2280 

TTTCAGAGTC GGTTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 23 4 0 

CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTG CA TTTGGTGGTA AAAAAGAAAT 24 00 

CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGCG GGTACGTTAT CAGGAAATCC 24 60 

TCTTGCAATG ACAAGTGGTT ATGAAACGTT AAGCCAATTA ACGCCAGAGA CATATGAGTA 2520 

TTTTAATATG TTAGGCGATA TACTTGAAGA CGGTTTAAAA CGTGTATTTG CTAAACACAA 2580 

TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA ATGAAGGACC 2 64 0 

TGTAACTAAT TTTGAACAAG CG AATAAAAG TGATTTGAAA TTATTTGCAG AAATGTAT CG 2700 

AGAAATGGCA AAAGAAGGTG TGTTTTTACC ACCATCTCAA TTTGAAGGTA CATTCTTATC 2 760 

TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTTTAAG 2820 

TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 2 880 

TAAATGTATA TAAGGCATGG GCGGCGACTT GATAGTGAAA GTCCACTACT A 2 931 
(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

r(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

AATTATGAAT GCATTACCAG TATTATTACA AAAGAACAAT TAAAAATGTT TGTTTATGAT 60 

TATGATACG C ATCTCATTAA AAATGTAaTG GTTGCAGCAG ACGTGTTAAA G G CAAATG AT 120 

ATTCAAGGAC ATGAACCATT AATCGTTAAC CTTCAAACGA TTGATGAAAC ATTACATCGT 180 

TTACCTATG C ATAATAGAAA AGACATGATG GTTAATGGCG GTGTACTTAT GGCACATTTA 24 0 

AATGCCAAAA GTGGTC CGTG GTTAAAAGAT GTGCTAAGAC AAATTGAGAT AG CGATTGTA 300 

ACAGGTAAAG TAAGCAACGA AGAAACTGAA ATTTTGAAAT GGGTGGATAA TCATGTCAAA 36 0 

ATATAGTCAA GATGTACTTC AATTACTCTA TAAAAATAAA CCGAATTATA TATCTGGACA 42 0 

AAGCATTGCG GAATCACTTA ATATTTCACG CACTGCAGTA AAAAAAGTGA TTGACCAATT 4 80 
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CCCAGATATT TGGTATCAAG GTATAATAGA CCAATATACA AAAAGTTCTG CTTTGTTTGA 600 

TTTTAGTGAA GTATACGATT CAATAGATTC TACACAACTT GCTGCGAAAA AGTCACTTGT 66 0 

TGGAAATCAA TCTTCATTTT TTATCTTGAG TGATGAACAA ACGAAAGGTC GTGGGCGATT 720 

TAATAGACAT TGGAGTTCTT CAAAAGGGCA AGGACTTTGG ATGTCTGTCG TGTTAAGACC 78 0 

TAACGTTGCA TTCT CAATGA TATCTAAATT TAATTTATTT ATTG CATT AG GGATAAGAGA 84 0 

TGC GATTCAA CATTTTAGTC AAGATGAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 90 0 

TGATAATGGT AAAGTGTGTG GTTTCTTAAC TGAAATGGTT GCTAATAATG ATGGTATAGA 960 

AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATGAAAG 1020 

TATTAGACAT AG AG CAACAA GTATACAATT ACATGATAAA AATAAATTAG ATAGATATCA 1080 

ATTTTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATTTT TAACGTTACC 114 0 

TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCTAAT A TTTGG AATA GAACGTTGCt 1200 

ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 1260 

TCTAATTGTT AGAGATGAAG CGGGTGAATC ACACCGTTTA ATTAGTGCAG ATATAGATTT 132 0 

TTAACACTAA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATTTG 1380 

GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 14 21 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

-(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CCAAGTTGCC TAAAATGATT AAGCAAGGTT TATACCCTAT G CnAACG ATT GAACAAGAAT 60 

CTGGAGCCAT CCGACTGCCA ACGATTTCTA GAGTGAnGCG TTCATTACAA TGGGG TAATG 12 0 

ATGCTTATAC AATGATTTTA GATCGTATGA ATATTGAAAC AAATGAATAA TAAATGAACG 180 

ATAAACAATG GTTAT CTATC TGCACTAATA AGGTAGATAA TCATTGTTTT TTCACGAAAA 24 0 

AATTTACAGA GTAAAAGAAC TTAAATTTCA TATTAAGTCT TTAGAACTCG ACACTTAAAA 300 

ATG CTATAAT CATATG TATG TTAAAAAAGG AGTTTCGGAA AATGTATGAC ATTAAAAAAT 360 

GGCG CCATAT TTTTAAATTA GACCCAGCTA AACATATTTC AGATGATGAT TTAGATGCGA 420 

TTTGTATGTC TCAAACAGAT GCAATTATGA TTGGTGGAaC TGATGACGTT ACTGAAGATA 4 80 
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CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 600 

GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 660 

5 

GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 720 

TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 780 

AAATGGTCAA T CATATG TAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 84 0 

10 

GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG 90 0 

GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATTGCA GATACAATTA 96 0 

15 TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 1020 

AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 1080 

GACAACAGAA G GACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 114 0 

20 AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 12 00 

TATTACTTTT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 12 60 

TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 1320 

25 

TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 1380 

AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 14 4 0 

30 TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 15 00 

TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 156 0 

ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 1620 

35 CTTATTTGAG CGTGTACCAG AAGTT CTAG A ATATTAT CAG AACAAATTCC AATATATTCA 16 80 

TGTAGATGAG TATCAAGATA CTAATAAAG C ACAATACACA TTAGTTAAAT TATTAGCAAG 1740 

TAAGTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 1800 

40 

TG CTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 186 0 

TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 1920 

AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 1980 

45 

TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 2040 

AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 2100 

50 CAAT CACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 2160 

GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 2202 
(2) INFORMATION FOR SEQ ID NO: 285: 

55 



1109 



EP0 786 519 A2 



(A) LENGTH : 785 base pair: 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 285: 

AGTGGTGCAA AGATAGGCAT TGATaATACC GCTAAGCCAG cAAGATGATG GCACGATAAA 60 

ACCTAAACAG AAGAAmATAA ATAGTAATAC GATGATAAAT AATGGTCCAC TCATATGTTG 120 

AACTAAAGAT GATGAAAAGT GTAAGATTGT ATCTGAAATC ATACCTTCAT TCAACACTAA 180 

ATTAATACCT CGAGCTAAAC CAATAATTAA AGATACACCT ACTAAACTTG ATGCACCATT 24 0 

GACAAATGCA TCTACAGTTC CTTTTTCTCC CAATCCAGAT TTACCTGTCC CAGCAATAAA 3 00 

CATTATTATA ATTGTAAATA TTAAAAATGC TGAAGCCATA ACTGGGAACC ACCAACCTTG 360 

CGTCATAACT CCCCATACCA TAATTGGAAA TGGTAGTACA AATAATGTAA GGATTATCTT 4 20 

CTTACGCAAA GTAAAATGGG CACTATCGTC ATCTTTTAAT ACAGACCATT GCTGTTCAAA 4 80 

AG CATCTTTG TCTTCATAAG AATATGACGC TTTAGGATCG TTTTTAATTT TTTTACAGTA 54 0 

C CAATAT AAA TAACTAATAA CAAAAATCGC ACCGACAATA CAAGCACCTA TTCTCCAATA 600 

CAAGCCATCC GTAAAAGTTG TACCAGCGGC ATTAGAGGCA ATTACAACCG AGAACGGGTT 6 60 

AATAGTTGAA AATGTACTAC CGACAGAGCT GGcAAGGAAT ATGGCACCAA CTGGAAACGA 720 

TAGAATCGTA TCCTAACGCT AATAAATATA GGGACTAAAA TCGGATAAAA TGCTACAGCT 7 80 

TCTTC 7 8 5 

(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 
- (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 60 

AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 12 0 

TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 18 0 

ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTGAATGT 24 0 

GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 300 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 4 20 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 4 80 

5 TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 54 0 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 60 0 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

W 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 

j5 CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 

(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 6 0 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AG CATGAAAA TTGTAATCAA ACCATAAATT 120 

30 

GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 180 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 24 0 

35 GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 3 00 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 3 60 

AAATTCAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 420 

40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGT CAT TACATTAAGT 480 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGG CGTTG TTATGGAAGA ACTTCGCTTA 54 0 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

45 AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAG CATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 7 80 

so 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 84 0 

AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATTGAAAA TCTTGGGCTA GCAATGGAAC 900 

55 
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CATTCCAATT TGCATATTTA AGTTTAATGC aGGAAGAAAA GATAcAAGCA AATCATCAAA 102 0 

TTACACCAGA TTCAATTGGA TTGATACTAG GATTTTTAGT TGAGCGTTTT ATGAACAACC 10 80 

5 AAGAAGAATT ACATATTGTT GATATTGCAA GTGGTGCCGG TCATTTAAGT GCTACTGTAA 114 0 

AAGAAGTGTT ACCTGraAtT GcGGTTATGc ATcATTTaAT TGaAGTTGAt CCAGTTTTAT 1200 

CACGTGTTAG TGTACATTTA GCAAACTTCT TAGAAATTCC TTTCGATGTG TATCCTCAAG 1260 

w 

ATGCCATCAT GCCACTACCA TTAGAAGAAG CAGATATCGT TATTGG TG AT TTTC CAG TAG 1320 

GCTATTATCC AATTGATGAA AGAAGTAAGG AGTTTAAGCT AGGTTTTGAA GAAGGACATA 1380 

j5 GTTATTCACA TTATTTATTA ATAGAACAAG CAATAAATGC ATTAAAAGAT G CTGGATATG 144 0 

CCTTTCTAGT GGTACCAAGT AATATTTTTA CAGGTGAACA TGTAAAACAG CTTGAAAAAT 1500 

ATATTGCAAC AGAGACAGAG ATGCAAGCAT TTTTAAATTT ACCACCAACT TTATTTAAAA 1560 

20 ATGAAAAAGC GCGAAAATCT ATATTAATTT TACAAAAGAA AAAATCGGGT GaAACAAAGC 1620 

CAGTTGAAGT ATTATTGGCA AATATTCCTg ATTTCcAAAA TTCCTTCACC AATTTC CAAG 1680 

GATTTATGGA CAGAGTTAAA ATCCAGTGGG ATGGGACCAC CAAATCGTCC TA 1732 

25 (2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

3S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

AAAAGAACTA GCTAAACGCA AGCAAGAAGC TATTAGTAGA ATTAAAGACT TTT CAAATGA 60 

AAAAATAAAT AGTATTCGAA ATAGTGAAAT TGGCACAGCT GATGAAAAAC AAGCAGCAAT 120 

40 GAATCAAATT AACGAAATTG TGCTTGAAAC AATTAGAGAT ATTAATAATG CGCATACATT 180 

ACAGCAAGTT GAGGCTGCAT TGAACAATGG TATTGCT CGA ATTTCAGCAG TACAAATTGT 24 0 

AACATCTGAT CGTGCTAAAC AATCGTCAAG TACTGGAAAT GAATCTAATA GC CATTTAAC 3 00 

45 

AATTGGTTAT GGAACTGCAA AT CATC CATT TAACAGTTCG ACTATTGGAC ATAAAAAGAA 360 

ACTTGATGAA GATGATGACA TTGATCCACT TCATATGCGT CACTTTAGTA ATAATTTCGG 42 0 

TAATGTTATT AAAAACGCTA TTGGTGTGGT GGGTATCTCT GGTTTACTAG CTAGTTTCTG 4 80 

50 

GTTCTTCATT GCCAAACGTC GTCGTAAAGA AGATGAAGAG GAAGAATTAG AAATAAGAGA 54 0 

TAATAATAAA GATTCAATAA AAGAGACTTT AGACGATACA AAACATTTAC CACTTTTATT 60 0 

£5 
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AAATAATGGC GAGTCACTCG ATAAAGTTAA ACATACGCCG TTCTTCTTAC CAAAACGTCG 720 

TCGTAAAGAA GATGAAGAAG ATGTGGAAGT TACAAATGAA AACACAGATG AAAAAGTGTT 7 80 

GAAAGATAAC GAACATTCAC CACTCTTATT CGCAAAACGA CG CAAAGATA AAGAGGAAGA 840 

TGTTGAAACA ACAACTAGTA TTGAAT CTAA AGATGAGGAC GTTCCTTTAT TATTGG CTAA 9 00 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAAATACTTC 9 60 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 10 20 

TTCTTTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 10 80 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATCTGCAATT 12 60 

AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 13 20 

GCATAAATAC ATGCAGCCCA TTCTGCAGTG TGGTTATCCA TTTCGCCTAA CTCATGTGTA 13 8 0 

TATGTATAAT GCTGCTCATC TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 14 4 0 

cCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA ACCTACTTTC TATTCAATAC IS 00 

TTAGTTAAAG TTACTATTAC TGTAATACAA AATATGTTGG GTAAT CCATT AAAAAACACG 1560 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTCGCTG ATTCAAAGAT GATTTTCTAA 16 20 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG 16 8 0 

TTCTATGCTC TACGATGTTA TATTGG CAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 174 0 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA 18 00 

GG CAATGCG A GTTGGGGTGT GGGTCCCAAC ACAGAAGATG ACGAAAAGTC AGCTTACAAT 1860 

AATGTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT TTCTACAGAC 192 0 

AATGCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAA AATATTATTT CTGTCCCACT 19 80 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTTAATA GCCTTGATCT AACTTATCAA 2 04 0 

TCTTACCTTT ACGATAAAAT GATTTAGCAA TATATCCAAA TGGTACATTG AAAACTGTTG 2100 

AAGCTAATTT TAATACGTAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CCAGGTAAAC 2160 

TTCCGATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 222 0 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 22 80 

AAGATAAATA CATCAATATG TTGACCAATA ATATATGCGA CGATTGAGCC TAAAGCAATG 234 0 

CGTGGCACAA CAT CAAAG AT TGCGTGTAAT GCTTTTTGTG CCATATCTTC TGGTGCAGGA 2400 
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CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 2 52 0 

TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAGTT 258 0 

TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 264 0 

TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTG CAATAAA CGTAACTAGT 2 70 0 

CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 276 0 

GAAACCGCTC AATAAATAA 2 77 9 
(2) INFORMATION FOR SEQ ID NO: 2 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 9: 

ACTGATGTGC GTTCATCAAA AACAATATAA TCAAATTCAT TTTCATCAAA TTG CTTAAAA 60 

TTATCATCTC TAGATAATGT TTGAATGGTT GCAAATAAAT ATTTGGCATC GACATCTCTA 120 

TGTTTTCCTG TCAACAATCC AAAATCACTA TCATTTTTTA TAGGTAATAC TTTTTTAAAT 180 

TCTTCCTTAG CTCTATTTAA AATCCCCTCA TTATGAACAA TAAATAAAAA TTTATTAGGG 240 

TTTACTTCTC TAACATCTAA TGCACATAAA ATCGTTTTAC CTGTACCAGT TGCAGATATT 3 00 

ATTAACGCCT TATCTTTGGC TTTATCCCTA ATAGCTTTTA ATGACCTTAA TGCTTCTGCT 3 60 

TGCATTAAAT TGGGTACAAT TTCCACTGAT TTTTTCACCT TATCAGCTAG CAG CATTTG A 4 20 

GTTTGTTCAA CCTCCGCTAA TTTTTCTAAG GAACGGTACT CAAATGATTC TTTATATGAA 4 80 

TTAATCCATT GCTCAGTCAG TGGGGTACTC TTTTGCCATA ACAAGTCAAA TTCACTTTTT 54 0 

ACACTATCAA CTAAATCGCC ATTTTTCATA GTAGACAGTA AAACATTATG CTCATAATTA 600 

ACCTTTAACG CATTAGATGT TAAATTAGAG CTTCCTATTA CCATAGAACT ATAATCCTTA 6 60 

TGCTCAAAAA TATATCCTTT GGCATGGAAT CCAGCAATAT CAGTTAATCT TACCTCTACA 72 0 

TTTTTTAATT TAAGTAATTC TCCATACATT TTAGGACTAT TAAACCCTAA GTAATTAGAT 780 

GTTAATATTT TCCCTTTAAC ACCCTTATTG CTTAAATCTA ATAGTTGAGC CTTTAAGCTG 840 

GcTAAACCGC TTTCTGTTAT AAAAGCCACA GAAAAATAAA ACGTTTCACA TTTTTGAAGT 900 

TCATCTATAA TTGTTGAAAG AACTTTTTCA TTTTTATTAT TTACTAAAAG CTTCGGTGTA 960 

TAATTC CCTT TATGAGAAAT ATGTTTGTCT ATAAACCCTT TATGTAAAGA TTGATTGAAA 1020 
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CGCTGGGGCC CAATTTAATT 
TTCAGTTAGA GTTGGTAACT 
5 TCCAAAATCA TATTCATGTT 

CATTTCTTCT CTAATTTCTC 
ACnGCCAGGA AATTCCCACA 

w 

AATTTTGTTA TCAGAAAAAA 
CACCCTTCAA TTTAAAATCA 
AAATCAATAT TAAAAATTAA 

15 

ATACACATCT ATTATATAAT 

TTTGGGGCGT ATTTCTTTAT 
20 TATTAAGTCT AACTGAGATT 

TAGAACGAGc TAATGATAGT 

AGC GAGAAGG TTTTGAAAGT 
25 GTCTTGGTGA AGGTCGACTT 

GTTATTTAAA TGCCCAAACA 

TTTCcAACTG AATTCGCCC 

30 

(2) INFORMATION FOR SEQ ID NO: 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGACTTTT CCCCCTCATA ATCTTCATGG TCCAGGCGTC CATTAATGCG TCAAAGGATG 60 

GCACATTTTA CCTGGAACAA ATGATTCATA TGGTTCATAA AAATCACGCG TCGTAATATA 120 

45 ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 180 

TAAAGATGAA TAGTCACTAA TTAATAAATC TGTTATGAAC AGTATATCAT TAACTTCTCT 240 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GCAATATTAA GTCTATTTTT 3 00 

SO 

CACAAATGGA TGCATTTTAA ATAATACAAC CGCGTTATTT TTTTCGCAAT ATCTTGCTAA 3 60 

ACGTTCAAAA TCAATTTTGA AAAATGGGTA ATGTGCTGTA CCATGACCAC T AC CT CT AAA 420 

ss 



TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG 114 0 

CTTTGTTTAA AGTACATTTG TATGTTGTTA AC CTAACAAT 12 0 0 

CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 126 0 

TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 1320 

TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 1380 

TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 144 0 

TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 

TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 1560 

ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 1620 

AATTCATTAC ATAAATG TAA GGGCTTTAGT TTT CAT GTTT 1680 

TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 1740 

TTAGTTAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 1800 

AAATTAACAT TTGGTACTGC GGGTATACGC GGAAAATTCG 1860 

AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 1920 

AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 1980 

1999 
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TTGTTTGATC TGTGTCGCAT AAG CTTCATC AAATAGTACA TCAGTACGTG GAACACCTGT 54 0 

AGGCACTACA TTTTTCTCTT TAATACCAAA TGCTTCAGCG TAGAATGGAA TATCGGTTTC 6 00 

AGATGATACA T AAG CTTTTG TATAGCTACG ATGATTTAAT GAATCAATAA ATGGTCCACC G60 

CTTTTTACCA GTACGACTAA AGCCAACTGT TTTAAAGGCA CCAACGGCAT GCCATACTTG 72 0 

AATAACTTCT TGAGAACGTC TAAAACGCAC TGTATAAATC AATGGGTGAA AGTCATCAAC 780 

AAAGATGTAG TCTGCCTTCC CAAGTAAATA TGGCAATCTA AACTTGTCGA TGATGCCACG 84 0 

TCTATCTGTA ATATTCG CTT TAAAAACAGT GTGAATATCA TACTTTTTAT CTAAATTTTG 9 00 

ACGTAACATT TCGTTATAGA TGTATTCAAA GTTTCCAGAC ATCGTTGGTC TAGAGTCTGA 9S0 

TGTGAACAAC ACCGTATTCC CTTTTTTCAA GTGGAAAAAT TTCGTCGTAT TAAATATCGC 1020 

TTTAAAAATA AATTGT CTTG TATTAAATGA TTGTTTGCGG AAATACTTAC GTAATTCTTT 1080 

ATATTTACGA ACGATATAAA TACTTTTAAC TTCCGGAGTC GTTACAACAA CAT CAAGG AC 1140 

AAATTCATTA ACATCGCTAG AAATTTCAGG TGTAACAGTA TAAACCGTTT TCTTCGAAAT 12 00 

GCCGCCTTTT CTAAATTCTT TTAGG TAAGT CTGCAATAAG AAATTGATTT TACCATTTTG 12 SO 

TGTTTCTAAT TCGTTGTATT CTTCTTCTTG TTCTGGCTTT AGATTTTGAT ATGCATCATT 13 20 

AATCACATCT GGGTTTAACT GTGCAATATA ATCAAGTTCT TGCTCATTCA CTAATAAGTA 138 0 

CTTATCTTCA GGTAAGTAAT AACCATTATC TAAGATAGCT ACATTGAAAC GACAAACGAA 1440 

TTGATTCCCA TCTATTTTGA CATCATTCGC CTTCATTGTA CGTGTCTCAG TTAAATTTCT 1500 

TAATACAAAA TTACTATCTT CTAAATCTAG GTTTTCACTA TGTCCTTCAA CGAATAACTG 156 0 

AACACGTTCC CAATAGATTT TATCTATATA TATCTTACTT TTAACCAACG TTAATTCATC 1620 

CTTTTCTATT TACATAATCC ATTTTAATAC TGTTTTACCC CAAGATGTAG ACAGGTCTGC 1680 

TTCAAAAGCT TCTGTAAGAT CATT AATTGT TGcAATTTCA AATTCTTGAC CTTTTAACAA 174 0 

CGCTAATTTA nCTACAATAT CTGGGTATTG AATGTATAAG TCTACAACAT CTTGGAAATC 1800 

TTTTGAACCA CTTCGACTAC TACCAATCAA CGTTAGTCCT TTTTCCAATA CTAGACGTGT 1860 

ATTAACTTCT ACTGGGAACT CACTTACACC TAACAGTnCA ATGCTTCCTT CTGGTGAAAT 1920 

GTAATCGATC ATT 193 3 
(2) INFORMATION FOR SEQ ID NO: 291: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

nGTnCGGnCA GATATATTGG TGGTCTTTAG TAAGTGTATC AAATTCATCA GATGTCAAGG 6 0 

GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC ATAAaCAAaT 12 0 

GAACACAaGA TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTTCTAGCA AAGCTGATGC 180 

TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 24 0 

TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 30 0 

ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 360 

GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGG CGAAG AAAAATTTGT 42 0 

GAAAAAGTTA T ACG CAACAC CTACACAAAT CCATCAATTG TTTGGAGTAT GTAGAAGTAC 4 80 

AGTATACAAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 54 0 

TGATTATTCA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 600 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 660 

ATTAGCAGTG TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 720 

TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT ATAAGGAATA 78 0 

CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 84 0 

ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAATACTATC 90 0 

ACATTTTGCA ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 960 

TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 1020 

AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 1080 

AAAAGAAATT TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC AAAGTGATGA 1140 

CCCTAGAGAA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 1200 

AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT ATTAAATAAG 126 0 

AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 1320 

GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAGAAGTT 1380 

AAGGAAACTT CTGCTGATTT GACTAAAGAT TATGTTTTAT CAGTAGGAAA AGAGTTCCTT 1440 

AAAAAAGCAG ACACTTCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTTGGTGCG 1500 

GATAAGCTAT CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT TATGAATGCG 1560 

AG AATAAAT G CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 162 0 

GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 1680 
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GTCTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 18 00 

GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 186 0 

AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 192 0 

AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC At TATATTTT 198 0 

CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 2 04 0 

TAGATAATC 204 9 
(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

ATGATGTTTC TATATTCGTA TTAGGAAAAC CTGTTGGTAT TACAACAAAC GCCCTAAAAT 60 

TACAATACCG CTGCTCCTAT ACCAATTGCA ACAACAGTTT TAACTGAAAT ATCTTGTTTT 120 

TTCATCTTCA TTACTCCTtt ACATAAAAAA TTCATTATAT TGATGGTGCT TTAGATAAAT 180 

GAATCGTCCA ATCATTTCCA GTAC CAAT AT GATATAAATC TGAAAATGAG TCTTGATTGA 24 0 

CTGCTACACC AATATTTACT AGCGAGTTAA CATACACAAG AGGTTCACCC ACATTAACAT 3 00 

CTGCAAACGA TCGCGCAAAT TTAATAATAT TTTGATAGAC TTTCTTATCT TGATGATAAA 36 0 

TTGTTACCAC CAAATTATTA CCATGAACAA TTTCCAAGGA TTTTAAGAAT GCCAATGGAA 420 

TATTTGTCCA TAATGACCCA AATCTGATAT CTAAAAT AT C AATGCTTCCT GTAACAGAAT 480 

CCTCATTTTT TGTCACTTCT CTTATTTCTA ATGCCTCAAT ACTATCAACA TTAATTGCCT 540 

G AC CGAGAC G TTCAAACGCT ATCTTATTTG CAGCTAATCT CGCACCATTG TATGCATAAA 60 0 

CATCTCTACC ATGAAAAGTA TGACTTTCTT CCGAATGAGG CAATCGGCTT TTCACTTCAT 660 

CAATTTCGAT AACTTTTTT A ATACCTTCGT AATGTTTAAT ATGACTTAAA GAACCATTAT 720 

CAGGTGTAAT AATGTAATGA CCTGAATATG TTAAGCAAGC AATGTCCGCC TATCACTACC 780 

TACACCCGGG TCTACCACTG ATACAAAraC TGTGCCTTTA GGCCAGTATT TTACAGTTTG 84 0 

ATATAAACGA TATGACGCTA CCCAAATGTC ATACGGTGGT ATATCATG CG TTAAGTTTTC 900 

AACACGTATA TCATCATTAC AGTATATGCA ACTCCATACA TT 94 2 
(2) INFORMATION FOR SEQ ID NO: 293: 
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(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 293: 

TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AACCTTCTTC ATTTCTTTAT 6 0 

GTGAGATTGT GaAATTAATC AAATAATAAT ACGGTGGATA CTTTCCTAAT TTACGATATT 12 0 

CCATTTCCTG ACGATAAAAT GTTAAATAAT CATTTTTTTG AACATCCAAT ATTGAATAAT 180 

GATCTGGATT ATACGTTTGA ATGATGACTT GACCTGCCTT TTCATGACGA CCAGCTCTAC 24 0 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 300 

ACATTGTATC TGCATTCAGC ACACCAACTA AAGTAATATT TGGATAATCT AATCCTTTCG 3 60 

CAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTCGAATTCA GTCAATAACT 42 0 

TTTCATGTG C ACCTTTCTTT GAGGTTGTAT CTACATCCAT CCTAATTATG CGCGCATCTT 48 0 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTACT TGTCGAATGT 54 0 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 

ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT ATCACAGTTT GGACATTGCG 66 0 

GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 72 0 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 7 80 

ACATTGACCG ATTACCTTCA CTCAATTCTT CACGCATGTC TACTATATCA ATTTCAGGTA 84 0 

AAG CTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTG CAC TTCCTAAAAT GACTGGACAG TGATGATATT 960 

CAcTTCGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG ATAATCTTCT TGTTTATATG 102 0 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 108 0 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATTCATCAT 114 0 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 1200 

TGAAGCGTAA AACCATTTGC GGTGTTAGAG CGATTTCAGG AACTAACATC ATCGCCTGTT 1260 

TTCCTTGG 126 8 
(2) INFORMATION FOR SEQ ID NO: 2 94: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

5 TACCACCAAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTGTC ATGATATCAT 6 0 

CAATAAAATG TTGAAACTTC GTAATTTCAC CTTCATAATC ATCAATTGCT GCTAATTGCG 120 

CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 18 0 

W AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGGTGC TGACATGACA GCGATACGTT 24 0 

T CAAAGG AT A GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 3 00 

CACTGATAGA TTCATAACCT T CATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 3S0 

15 

ACCAATCATC TACATTATAT GTCATGAAAT CTTTCAACAA TAAACCATGA CCTGGATAAT 42 0 

TCGGTGCATA ACAACTAAAT CCTTGGTCAT TTAACTCAGC TGCAAGATGC TTCACATCCC 4 80 

ZQ GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT 54 0 

ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCCaTATAcC TCCACTAtGT CTAAAGakGT 60 0 

TkGCTAAACG CGTTGtCGTC GATGATTAA 629 

25 (2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

35 

TATGAAAGTA ATGAATGGTA ATATTATTAA ATTTGATGGA AAAGTAGATA TTGATAATGC 60 

AGATAATATC GGTTTTTTAA TTGAGCATCC TAAATTATAT GATAATAAAT CAGGATTGTA 12 0 

4Q TAACTTGAAA TTATTTGCAC AAGTATTAGG TAAGGGTTTT GATAAAGCAT ACACAGACAA 180 

AATTATAGAT GCATTTGGTA TG AG AC C TT A TATTAAAAAG AAAGTTAAGA AATATT CAAT 240 

GGGGATGAAG CAAAAGTTAG CAATTGCAGT ATCTTTArTG AATAAACCTA AATTTTTAAT 300 

45 CTTGGATGAG C CT ACAAATG GkATGGATCC AGATGGCTCa ATTGATGTGC TGACTACAAT 360 

TAAGTCTTTA GkAAATGaAC TTGATATGAG AATTCt AATA TCAAGTCATA AGTTAGAAGA 420 

TATTGAATTA ATTTGTGATA GAG CTGTATT TTTAAGAGAC GGnCATTTTG TTCAAGATGT 480 

50 AAACATGGAG GAAGGTGTTG CATCTGACAC AACGATAGTT ACTGTTGATC ATAAAGACTT 54 0 

TGATAGAACT GAAAAATATC TTGCAGAGCA TTTCCAATTA CAAAATGT CG ACAAAGCAGA 600 

55 
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ATTAGATATT TATCCGAAAT ATATTGAAAC ACGTAAAAGT TCATTGCGTG ATACGTACTT 72 0 

CAATATAAAT CAAAGAGGTG ATAAATAATG AGAATTTTAA ATTTAGTTAA GTATGATTTT 7 80 

5 TATAGTATAT TTAAaArTCC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 84 0 

GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC CGAAACATAT TATTGTCTAT 900 

GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 96 0 

AAGACAATTA CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG TAAAGTTAAA 1020 

ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 1080 

J5 GCATTAATTC ATTATGTGAT TTTGATTGTT GTGCAGGCAT CTAGTAATGG AAAAAATTTG 1140 

GCGTTTTCTA AATATGTAGA TAATTTATGG TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

TTGTTTTTAT TCTTAATCAC ACTTGCATCA CAAAAAACAG CAATGATATT TTCATTAGGT 12 60 

20 GTATTTTTAG TACTCATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 13 20 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGATAAAATG 13 80 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 144 0 

25 ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 1500 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAG C GTCAGTTTTA 1620 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 168 0 

TTGCTCGGTA ATTTATCAGC ACGAACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 174 0 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

35 

CGTGATTTAC CATTTGTTAA ATATTGATCG ATCAGTTTTG TATCATCATC TCTATGGAAC 186 0 

ACGCGTACTT CTAAATTTAA TGCTTCACTG ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

40 GCATCTCCAC ACCAGTCTTC AGTAATTACT AATACTTTAG AATAATTCAT CTCTTTTATT 1980 

TTTTTGATGC GTGAATCATC TTCTGGTAAC TCAAATGATT GATAGATACT GAGAACGGTA 2 04 0 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

45 GTTTCTAAAT TTGTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 216 0 

AGTAATAAAA GGTAATGAAA TAAATTAATT GCAAATTCTT TGTTAATTTT TGTTAAGGAT 2 220 

GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 22 8 0 

SO 

GTATTCAAAA TCGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 234 0 

TTGGTTTATA TCAATATCGT CATACGAAAA CTGTGGATTT GTCTAAT CTT GAAATAAACG 24 0 0 

55 
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TTGATCGTTT TAAATTTTAT AACAGTAAAG CTCACCCTGA TCTTACCGTT AAAGTGAGAG 2 520 

AAAAGGATAA CATCGTTAAG GGGATAATAT TAGTAAGAGA TGAAAAGATA CATACTAATT 2 580 

TTGACGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa TTCgGATATA 2640 

AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA GATAACCATT 2700 

TAAAATTAAA CTTACTTTAT CAAGATTTAG AAATTAAACG TATTGAATTT TTTAGTAAAT 2760 

AGCTTTAGGT CTTAAAGTTw TAAAAAACGA ATGAaTAATT TTATTGGGAT GAGTGAC 2 817 
(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

TCTGTTAAAA TGA TTTTT CT TTTAnAAAGG CCGnAAATCA ATGTTCGATT nTTATTTGCA 60 

TTATGGTCTC GATATTGGTA GAATATCAAA TGGTTAAATG AGAAAAACTT GGAGGTGCTC 120 

ACATGTCATC AATCGTAGTA GTTGGGACAC AATGGGGAGA CGAAGGAAAA GGAAAAATAA 180 

CGGATTTCTT GG c AGAACAG TCAGATGTTA TCGCGCGTTT TTCAGGTGGT AATAATGCAG 24 0 

GCCATACCAT TCAATTTGGC GGAGAAACAT ATAAATTACA TTTAGTACCA TCTGGTATCT 3 00 

TTTACAAAGA CAAATTAGCG GTAATCGGTA ACGGAGTCGT TGTTGATCCA GTTGCACTAT 3 60 

TGAAAGAATT AGACGGATTA AATGAACGTG GCATTCCTAC AAGTAATTTA CGTATATCTA 420 

ATCGTGCGCA AGTGATTTTA CCATATCACT TAGCACAAGA TGAATATGAA GAACGTTTAC 480 

gTGGtGACAA TAAGATTGGT ACAACTAAAA AAGGTATCGG TCCAGCATAT GTAGACAAAG 540 

TTCAACGTAT CGGTATTCGT ATGGCAGATT TACTTGAAAA AG AAACATT C GAAAGATTAT 600 

TAAAATCAAA CATTGAATAT AAACAAGCAT ATTTCAAAGG TATGTTTAAC GAAACATGTC 660 

CATCATTTGA TGATATCTTT GAAGAATATT ATGCAGCAGG TCAACGTCTA AAAGAATTTG 720 

TAACAGACAC ATCAAAAATC TTAGACGATG CATTTGTAGC AGATGAAAAG GTACTTTTCG 78 0 

AAGGTGCGCA AGGTGTAATG TTAGATATCG ACCATGGTAC ATATCCATTC GTTACATCAA 84 0 

GTAATCCAAT TGCAGGTAAC GTTACTGTTG GTACAGGTGT AGGTCCTACA TTCGTTTCAA 900 

AGGTAATTGG TGTATGTAAA GCTTATACAT CACGTGTTGG TGATGGTCCA TTCCCTACTG 960 

AATTATT CGA TGAAGATGGA CATCATATTA GAGAGGTTGG TCGTGAATAC GGTACAACAA 1020 
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TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 114 0 

GAATTAGACG GTAAAGAAAT TACTGAGTAC CCAGCAAACT 12 00 

AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 12 60 

GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 13 20 

CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 13 80 

TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 14 40 

TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

ACTAACGTAA AGGCGATATT TGCTAGTCAT AATTTAGAAG 15S0 

AAtTAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 297: 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

TTAGAAGTAA GCACTTTAAT ATCTTTACCT ACCCATGTAC CAACACCTTC TTTAGGCTTT 60 

30 GGATTTTCAG CATGGTTATT TGATTTATTC ACCTGTTCAG ATCTATTTAC TTTATTACCT 120 

TTATTCGGGT TCTCTAATAC ATCAAATTTT AATCTCGGTG AATAAAAAAG ATATATTAAA 180 

AATCCAACTA AAAATAATAC ACCGACGACT CTTATAATTA ACTTTTTCAT CAATCAACCA 240 

35 

CCTAAAAAGT ATTAATACTA TTGTAAAAAA CAACACATTA ATTAGCAAAT TTTCAACACT 3 00 

GACATAACTG TGTCGTTTCG ATAAAACAAA ATCATCTTCA GGCATAATTT TAACGTCATA 360 

4o AAGATTTTCC CTACACTCTA TATCATAACC TATCTTTATG TTTTCAGGTT GAATTTCATT 4 20 

ATTAAGATTA AAATATGTAT AAAAAAATGG ACAGTTAAGG TAT C AAATG A ATACCATCAA 4 80 

CTGTCCGACT ATTCTTCATC AAAAAACCTG ATAAAACAAA TTGCCTTATC AGATTAGTAT 54 0 

45 CATTTGTATA AGCATATTAA TGACCAAATG TTGCTTTAAT CAGTGATGTC GTTTCTCCAC 600 

CTGGATATAA TACATATAAT AATAAGTAAA CTGCTACACC TGTAATTGCA GTACAGAACC 660 

ATATAACTGA AGCGAATGGC CCGAATTTGC GGTGTACATT ATATTTATCT TTAAATGCAG 720 

50 TAATAATTTG AACTAGGCCT AGAATACCAC CAATTGTTGC TAAATTAATG TGGAAAAATA 7 80 

AGAAAATCGT ATAATATTTC TTAATTGATG CTGGTCCGCC AAATGCTGTA TTACCGATAA 840 

£5 



TAAGTGGTAT TACAGATTTA 
TGAAAATCTG TACAGCTTAT 
TAGATCAATT AAAACGTTGT 
TAACAAATGT GCGTACTTTA 
TTTCAGAATT ATGTAATGTA 
CAAACCTATT AAAAGAATTG 
ACATGAGCCT TCTATCTTTA 
CGATTATTTA GATAAAAAAT 
rTTAgATGAt AtTtAACGAA 
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TAACATTTTT GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA AT GG CAATTA 960 

AAATTGCACT AATGACAATA CATGTCGTAC TAATCGTTGG TAAAATTGGA ACGCCCATAT 10 2 0 

5 TTTTCATCCT AACTTAATTA ATCTAGATCA AAGTAAGTAA TGAAACAATC ACAGCTAACA 10 80 

CGAAAAAGAT CACTAAATAA TTTAGTGAAT AT AT AAA CAT TTGTGTTGCC CATTTTGTTT 114 0 

GATCTGAATT TTTCTTAAAT GTTGTTAAAC CTAATGCAAT CCATCCTAAA TTTAATAAGG 12 0 0 

TrGCTAACAC TaCGAATACG ACACCTAAAT TTATTAGTAA TAAAGGTACT GGCAATAAAA 12 60 

TAATCAACCA GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT GATGGTAACA 1320 

)5 TTGGAATATT TGCAAGTGCA TATTCATCTT TACGTTTAAT AGCTAAGGcA TAAAAATGAA 13 8 0 

TTGGtTGCCA ACmAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA ATTTGTCCTT 1440 

CAATTGCAAC CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA ATCACTGTGT 15 00 

20 TCCATGTTGT ATGTCTTTTA GACCATATTG AGTAATAAGA CACATAACCT ACAATCCCCA 15 6 0 

TAAGACCAAG TACG CCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT CCAACTAACA 16 2 0 

TCATAC CAAA ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT ACAGTTGGTC 168 0 

25 TATTTTGTTT ACTAGGCATA ATACGATCAA TATCTTGGTC GTAATAATTA TTTAACGCAC 174 0 

ATGCGCCACC CATAATTAAA GTAGATCCAA ATAGCATTAA TAAAATTTGA GGTATTGATG 18 00 

ATAAGAAGGA ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA GCCGGAATTA 186 0 

30 AGTTACCTTG AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 192 0 

TAATATTTTG TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA TTTGTTCATA 19 80 

TAAGATTATG ATATCTTAGA TTGCATAAAA AGACTAGGTT TAATAAAATT AAATTGTGAC 204 0 

35 

AAATTAACGA CAAGAGAAAA TGTCAATTTT GTGACACAAA TAACATTTAA TTTATTG CTA 2100 

TAATGTATAT GTTAGAAAAT TTTAATAAGT AGAATCATGC AT CTAAAAGA GATTAATATT 216 0 

4Q TAAGCTTCAA ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA AATCATTACG 222 0 

ATTAAGTTCT TTTTATGTCG TCCACATACA ATACTTGTAA AATTAAATCA TATTTCCTGC 22 80 

GTTGGATCCC ATCTTTTCAT ATCCTACAAT CAGGTCTATT TATAGTATCA TCTCAAATCC 234 0 

45 GGCTATTAAT TCTAATTCTC AGTGATGCGT TTTTTTATGA TGGGGTGTAT AAATTGTTTG 24 0 0 

GCAAAAAGAA TTTAAAATGG TTAGGTGTCG TAGCAACGTT AATGATGACA TTTGTACAAC 24 6 0 

TTGGTGGAGC CTTAGTTACC AAAACCGGAT CAGCTGATGG TTGTGGTTCT TCTTGGCCAC 2520 

50 TATGTCATGG TGCGTTGATT CCAGAATTCT TTCCTATTGA TACGATTATT GAGTTAAGTC 2580 

ATAGAGCCGT TTCAGCTTTG T CTTTATTAA TGGTCTTATG GTTAGTTATC ACTGCATGGA 2 64 0 

55 
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TATTGCAAGC ATTAATCGGA GCTGCTGCTG TTATTTGGCA ACAAAACGAT TACGTTTTAG 276 0 

cATTGCACTT TGGTATATCA TTAATCAGTT TCTCATCTGT ATTTTTAATA ACATTGATTA 2820 

TTTTCTCTAT AGATCAAAAA TATGAAGCTG ACGAATTATA TATCAAAAAG CCATTAAGAC 2880 

GTTTAACATG GTTAATGGCA ATCATCATTT ATTGTGGTGT TTATACTGGT GCACTAGTGa 2 94 0 

GACATGCGGA TGCAAGTTTA GCATATGGTG GTTGGCCATT GCCATTACCA CGATCTTGTA 3000 

CCACATTCAG GAACAAGATT GGGTTCAACT CACGCATCGT ATCAnGGTCn nTTAA 3 055 
(2) INFORMATION FOR SEQ ID NO: 2 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

TTCTTATTTA AAGAAGTCAT TTTTAGAAAT TGTTGAGACT TTAAAAAATG ATCCGTATAA 60 

AATAACACAA TCTTTTGAAA AATTAGAGCC TAAATATTTA GAGCGATATT CAAGAAGAAT 120 

TAACCATCAG CACAGGGTCG TCTATACCGT AGATGATCGA AATAAAGAAG TATTAATACT 180 

ATCGGCATGG TCACATTATG ATTAATGAAT ATTCAATATC TGAATAACTT TAATGATAAG 240 

TTAATTAAAG AAACTAGTAT TTAAGTGTAG GGAAAATAGC GACGTTAATG CGTTGTTATC 3 00 

TCTACACTTT TTAATTTTAT AAT AG CGCAA GACTAAACAG ATTGAAATTA GTAACAATAA 360 

AAGAATAACG TATTATAATA AGGAATTTTA AATTGTGACT TTTTCGGAAT ATTAAATTTT 420 

AGAAATATGA GGTTTTTAAG CGGATTCCTC ACAAAATTTT AAAAATATTT AAGCCTGAAA 480 

ATGATAAAGC GGTAGGGAAC GTTTTTCTGA AaGTTAGTGA TACAATAGTT TTAAGTTGAA 540 

ATACAGGAGG ATGAATAACA TGAATCAGTC AGTCAAATTA CTTAAACATT TAACAGATGT 600 

AAACGGCATT GCTGGTTATG AAATGCAAGT TAAAGAAGCA ATGCGTaACT ATATAGAGCC 660 

TGTCAGTGAT CaAATTATTG AAGATAACTT GGGTGGCATT TTTGGAAAGA AAAATGCTGA 720 

GAATGGTCAA TACTCAATTA TGGAnTTC 74 8 
(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

ACCTCCGAAT AATTGATTCC ATTAACTTTT TTTGTGaAAA AtTTAAACAG GCGAAGTCTT 60 

5 CAATAAGTGA AGATGTC CAA ATTATAAAAA ATACATTCCA AAAAGAAAAG TTAGGTACAG 120 

TAATTACCAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATGATGA GTAAAGAAGA 180 

GGCGACTGAA GTTGTTAATG AGGTCATTAC TCTATTAGAA GAGAAAGAAC GTTTGTTAcC 24 0 

10 

TGGCGGATAT TTATTTTTAT CAGATTTGGT AGGTAATCCA TCGCTACTAA ACAAAGTTGG 3 00 

TAAGTTAATT GCCAGTATTT ACATGGAAGA AAAATTAGAT GCTGTTGTTA CCATTGCGAC 3 60 

js AAAAGGTATT TCATTGGCAA ATGCGGTTGC TAATATTTTA AATTTACCAG TAGTAGTGAT 420 

TAGAAAAGAC AACAAGGTGA CTGAAGGTTC T ACAGTTT CA ATTAATTACG TTTCAGGATC 4 80 

TTCAAGAAAA ATAGAAACAA TGGTACTTTC GAAGAGAACT TTAGCAGAAA ATTCAAATGT 54 0 

20 TTTAGTTGTC GATGATTTTA TGAGGGCTGG TGGCTCTATT AATGGTGTTA TGAATTTAAT 600 

GAATGAGTTT AAAGCCCATG TAAAAGGGGT ATCAGTACTT GTAGAATCAA AAGAAGTTAA 66 0 

ACAAAGATTG ATTGAAGATT ATACTTCCTT AGTGAAATTA TCTGATGTAG ATGAATATAA 72 0 

25 T CaAG AGTTT AACGTAGAAC CTGGCAACAG TTTATCTAAG TTTTCATAAA AGGAGTTTTA 780 

GTATTAT GAA AATCATTAAC ACAACAAGAT TACCGGAAGC ACTTGGACCA TATTCGCATG 84 0 

CAACAGTTGT GAATGGTATG GTTTATACTT CTGGTCAGAT TCCATTGAAT ATTGATGGAC 900 

30 

ATATCGTAAG CGCTGATGTT CAAGCACAGA CAAAACAAGT TTTAGAAAAT TTAAAGGTTG 960 

TTTTGGAAGA AGCAGGATCT GATTTGAATT CTGTTGCGAA AGCGACCATT TTCATTAAAG 102 0 

ATATGAATGA TTTCCAAAAA ATAAATGAAG TGTATGG TCA ATATTTTAAT GAACACAAGC 1080 

35 

CAGCGCGTAG TTGTGTAGAG GTTGCGCGTT TGCCAAAAGA TGTGAAAGTA GAAATTGAAT 114 0 

TAGTAAGTAA AATTAAGGAA TTATAATTTT CGATTAATAT GTTTAATCAA GCTTCTAAAT 12 00 

40 AAAACAGAGA GATATATACT ATAGGGGGGC TCACTACATG AAAGTGACAG ATGTAAGACT 1260 

TAGAAAAATA CAAACAGATG GACGAATGAA AGCACTCGTT TCCATTACAT TAGATGAAGC 1320 

TTTCGTAATT CATGATTTAC GTGTAATTGA AGGAAACTCT GGCTTGTTCG TTGCAATGCC 13 80 

45 AAGTAAACGT ACAC CAGATG GTGAATTCCG CGACATCGcg CATCCTATTA ATTCAGATAT 144 0 

GAGACAAGAA ATTCAAGATG CAGT GATGAA AGTATATGAT GAAACAGATG AAGTAGTACC 15 00 

AGATAAAAAC GCTACATCAG AAGATTCAGA AGAAGCTTAA TCAATTTTAT ATTTAGCGAT 1560 

50 GTAATACATT TGCAATAAGT TGATTTGATA CTGTCGATAA AGCATAAAGC TTTGTCGGCA 1620 

GTTTTTTTAG TTTGTATTAA TGTTTTTTTA TTTTTAATGA AAGGCTAATA AATATATACG 1680 
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TGaTGCTCGT ATTTTTGAAG TAAGAAAAAA GTTGTTTTTA AAATTACAAC GAATTAAAAA 1800 

CAATGCCTTT TATATGTTGA AAGAGTATTG CaGATTAAAT TaTAATAATG ACGAaGgTAA I86 0 

5 AATTTAATGG GGGTTAATGT TCATGCGAAG ACACG CGATA ATTTTGG CAG CAGGTAAAGG 1920 

CACAAGAATG AAATCTAAAA AGTATAAAGT GCTACACGAG GTTGCTGGGA AACCTATGGT 1980 

CGAACATGTA TTGGAAAGTG TGAAAGGCTC TGGTGTCGAT CAAGTTGTAA CCATCGTAGG 204 0 

ACATGGTGCT GAAAGTGTAA AAGGACATTT AGGCGAGCGT TCTTTATACA GTTTTCAAGA 2100 

GGAACAACTC GGTACTGCGC ATGCatGCAA ATGGCGAAAT CACACTTAGA AGACAAGGAA 2160 

GGTACGACAA TCGTTGTATG TGGTGACACA CCGCTCATCA CAAAGGAAAC ATTAGTAACA 2220 

15 

TTGATTGCGC ATCACGAGGA TGCTAATGCT CAAGCAACTG TATTATCTGC ATCGATTCAA 22 80 

CAACCATATG GATACGGAAG AATCGTTCGA AATGCGTCAG GTCGTTTAGA ACGCATAGTT 234 0 

zo GAAGAGAAAG ATGCAACGCA AGCTGAAAAG GATATTAATG AAATTAGTTC AGGTA TTTTT 24 00 

GCGTTTAATA ATAAAACGTT GTTTGAAAAA TTAACACAAG TGAAAAATGA TAATGCGCAA 2460 

GGTGAATATT ACCTCCCTGa TGTATTGTCG TTAATTTTAA ATGATGGCGG CATCGTAGAA 2520 

25 GTCTATCGTA C CAATG ATGT TGAAGAAATC ATGGGTGTAA ATGATCGTGT AATGCTTAGT 2580 

CAGGCTGAGA AGGCGATGCA ACGTCGTACG AATCATTATC ACATGCTAAA TGGTGTGACA 264 0 

ATCATCGATC CTGACAGCAC TTATATTGGT CCAGACGTTA CAATTGGTAG TGATACAGTC 27 00 

30 ATTGAACCAG G CGTACGAAT TAATGGTCGT ACAGAAATTG GCGAAGATGT TGTTATTGGT 2760 

CAGTACTCTG AAATTAACAA TAGTACGATT GAAAATGGTG CATGTATTCA ACAGTCTGTT 2820 

GTTAATGATG CTAGCGTAGG AGCGAATACT AAGGTCGGAC CGTTTGCGCA ATTGAGACCA 288 0 

35 

GGCGCGCAAT TAGGTGCAGA TGTTAAGGTT GGAAATTTTG TAGAAATTAA AAAAG CAG AT 2940 

CTTAAAGATG GTGCCAAGGT TTCACATTTA AGTTATATTG GCGATGCTGT AATTGGCGAA 3 0 00 

4o CGTACTAATA TTGGTTGCGG AACGATTACA GTTAACTATG ATGGTGAAAA TAAATTTAAA 3060 

AcTATCGTCG GCAAAGATTC ATTTGTAGGT TGCAATGTTA ATTTAGTAGC ACCTGTAACA 3120 

ATTGGTGATG ATGTATTGGT GGCAGCTGGT TCCACAATCA CAGATGACGT ACCAAATGAC 3180 

45 AGTTTAGCTG TGGCAAGAGC AAGACAAACA ACAAAAGAAG GATATAGGAA ATAATCATTT 3240 

ACGTATTTAA AATGGCTAGG ATAAAAGGAT AATCCTATGT AATATTAATG TAATCTTTAT 3 3 00 

GATTTAATGA TTCGCATAGT AATGGAGTTA CATyTTATAT ATAATAGTAA TTGCGTAAGT 3360 

50 AAATAATTGG AGGACTATAA ATGTTAAATA ATGAATATAA GAATTCGTCA TTAAAGATTT 3420 

TTTCATTGAA AGGAAACGAA GCATTAGCGC AAGAAGTTGC TGACCAAGTA GGAATTGAAC 3480 
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GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC ATATCCTGTG AATCTACATT 3 600 

TAATGGAATT ATTAATTATG ATTGATGCTT GTAAACGTGC TTCTGCAGCA ACAATCAATA 3 660 

TTGTAGTGCC AT ATT AT GG A TATG CAAGAC AAGATAGAAA AGCCCGTAGC CGTGAGCCAA 3720 

TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGctGG CGCAACTCGT ATGATTGCGT 378 0 

TAGACTTACA TGCACCACAA ATTCAAGGAT T GTTTG AT AT TCCAATTGAC CACTTAATGG 3 84 0 

GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA TATTAACCCA GAAGAATGTG 3 900 

TCGTTGTTcA CCAGACCATG GCGGsnTTAC ACGTGCACGT AAATTAGCTG ACATTTTAAA 3 960 

AACTCCAATT G CAATTATAG ATAAACGTCG TCCTAGACCA AATGTTGCTG AAGTGATGAA 402 0 

CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT GACGATATTA TTGATACAGC 40 80 

AGGTACAATC ACTTTAG CTG CACAAGCATT AAAAGATAAA GGTGCTAAAG AAGTATATGC 414 0 

TTGTTGTACA CACCCTGTTT TATCAGGACC GGCTAAAGAA CGTATCGAAA ATTCTGCTAT 4200 

AAAAGAATTA ATCGTAACAA ACT CAATTCA TTTAGATGAA GATCGCAAAC CATCTAACAC 4 260 

TAAAGAATTA TCTGTTGCTG GTTTAATCGC ACAAGCTATC ATTCGTGTAT ACGAAAGAGA 43 20 

atcagttagc gtattatttg actaatattt aaaaggcgtt tgacgaacat attccaaacg 43 80 

tgtataatag tttcgttcgt gattatacga ataaataaac acttgcaagc aacgatgatg 444 0 

ttgatgggta agtgaggtgc tcgttttgag caaaaatgaa aggtggaaat gagaatggct 4 500 

tcattaaagt caatcatccg tcaaggtaaa caaacacgtt cagatcttaa acaattaaga 4 560 

aaatctggta aagtaccagc agtagtatac ggttacggta ctaaaaacgt gtcagttaaa 4 62 0 

gttgatgaag tagaattcat caaagttatc cgtgaagtag gtcgtaacgg tgttatcgaa 4 6 80 

ttaggcgttg gttctaaaac tatcaaagtt atggttgc 4 718 
(2) Information for seq id no: 300 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA 60 

TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 12 0 

ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTAC TATTTA TTACGTAAAA ACTTTTCGTT 180 
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TGCTGTTTCC ATCGCATATG GATTTAGTAA GTTCTTTATG GGTACTGTAA gTGATCGGAG 300 

CAATGCTCGG ATATTCTTAG TTCTTGGATT AGCACTCACT GCTATCGTCA ATTTGTTAAT 36 0 

5 GGGATTTGTA CCGTTCTTTA CATCAGGTAT CGGTATTATG TTTGTCCTAT TATT CTTAAA 42 0 

TGGATGGTTT CAAGGTATGG GCTGGCCACC TTCAGGCCGT GTTCTCGTTC ACTGGTTTAG 480 

TGTAAGTGAA CGCGGAAGTA AGACTGCCCT TTGGAACGTT GCGCATAATG TTGGTGGAGG 54 0 

to 

TATTATGGCA CCTATTGCTG CTTGGGGTAT TACAACAACT GCATTT AT CA ACTTTGGTTA 600 

TTTAAAAGGT TTCGAAGGTG TATTCATTTA CCCTGCACTC TTAGCACTTA TCATTGCCGC 6 60 

AATTTCATAC GTATTGATTA GAGACACACC TCAATCTCAA GGTTTACCTC CAATCGAAAT 720 

15 

TTATAAAAAT GACTTTGCTA CAAGCGATAA GAAAACATTA GAAACAGAAT TAACTACAAA 780 

AGAAATTTTA TTTAAATATG TACTGAACAA TAAATGGGTA TGGGCAATTG CCTTTGCAAA 840 

2g TATATTTGTT TATTTCGTGC GTTATGGTGT ACTTGATTGG GCGCCAGTCT ACTTAAGTGA 900 

AGAAAAACAT TTCGACTTAA AAGCATCAGG TTGGGCATAC TTCTTATACG AATGGGCTGG 960 

AATTCCTGGT ACATTATTAT GTGGTTACAT TTCTGATAAA TTATTCAAAG GTCGTCGTGG 1020 

25 ACCTGCAGGT TTCTTCTTTA TGTTAGGTGT CACAGTATTT GTATTAATTT ATTGGTTAAA 1080 

TCCTCCAGGC AATGCTTGGT TAGACAATGT CTCATTAATT GCCATTGGTT TCTTAATATA 1140 

TGGACCAGTT ATGTTAATTG GTTTACAAGC ATTAGATTAT GTACCTAAAA AAGCAGCTGG 120 0 

30 CACAgcAGCT GGATTAACAG GATTATTTGG TTATCTGTTT GGTGCTGTAA TGGCCAACAT 1260 

CGTCTTAGGT GCTGTAGTTG ATAAATTCGG ATGGGATGTC GGTTTTATTT TATTAACAGC 1320 

AATTAGTGTG TTTGCAATGT TGAG CTTTAT CCTCACTTGG AATAAAGTAG GACAAGAAAC 138 0 

35 

CGTTCATCAT TAAATGATAA AAAATAAAGT CATATGGTTA TCTTATCGAA AGATGATATA 1440 

TTCATCTCTT ATAAGTTCAA CCATATGACT TTTTATTAGT ATTCAAAAAA ATATTTACAT 1500 

TGCCACTTTT GTGTTTGCCC TGCTGTTTTA TTCAATTGAT TACACCACTT AGGATAAACT 1560 

CTAAAAGCCA TTTTCCCTTG ATACTTAGAT GAAGCTAAGA TAC CTTTTT C TACCAATAAT 1620 

TCCCTAGGAA ATAGAAAGTA GCCATTTAAT TCATCATCGA TAACAGCAAC AATCAGGTAA 1680 

4S TCAGCAAACT CTTCAATTTT GTATGG CCGA TTATAATTAT CtTCGTCTTT TGTCCAACAT 174 0 

GTCACGAAAT ATCCCGATTT AGTCGGTGTT TTCTTAGCTA ACCTACTTTG ATATGTTTCC 1800 

TCTTTAAAGC TAAATGTTAA TGCyTCGTAA TCTTGATTAT ATTTTTCTTC AGTTAAATCT 1860 

50 TTAACTTCTG ATTCTTCACT AAAAATATTT TTCAACAGTA TTTTAGATTT ACACATATGC 192 0 

CAATCTCACT TTATTTTTTC ATAATCGTAT CATATATTTA TTTT T TTCGA AAAATACACT 1980 
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TGTATCAACT GTCAAACGAT CACGTAATAA ATAGACGATT AACATTGcTG CTAAAG CACA 2100 

AAGTG ACT CA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAG CATT CAT 2160 

AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCCATAATTG TAG CAC CACG 2 22 0 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 2280 

TGATGCCATT GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 2 34 0 

AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 24 0 0 

AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 24 6 0 

AAAGTTATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 2520 

cLAATTGCACA AGTCTAAATG AGATACCATA ACTTGCaATC GCGAAGTTTC CATAATGTGC 2 580 

TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 2640 

AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 2700 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC 27S0 

ATTACTGATT GCAGTACCCA AAGCTGcACC AACAACGTTT aAATCAAAAC CAAAAATTAA 2820 

AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 28 80 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 2 940 

ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT AACGTTTCAC CTCTCGCCCC 3000 

TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 3060 

TGCAATACCA CCATAAATAG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCGC 312 0 

ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 3180 

T 3181 
(2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 40 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 01: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

TGTTGCTGTT CATCCATTCA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTG CCAA 120 

AGAGCAACTA ATTACTG CAA CAATAGCGAC CATGATTTGT GATTATTTTT AGAACCACAA 180 
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TTATTAATAC ACTATTTTAA GTATCTCTAA TTGATGATTG TATTATAGAA TTAAAATTGA 300 

TGGAATGCAT CAACTGGCAT AACAAATACA GTAGCACCGC CAACTTCAAC TTCAACTGGA 3 60 

5 TATGGAATGT ACGAATCCGC ACTGCCTCCC ATAGGTGTAA TAGGTGAAAC CAACTG TTCT 420 

CTATTACCAC ACGTTTGATT AATCACAGAC AATATTTCAT CTACACGGTC AT CATTGACA 480 

CCACATAAGA ATGTTGTATT ACCCGCTCTT AAAAACCCAC CTGTTGTTGC CAATTTTGTT 54 0 

10 

GCTCTAAAGT TATTTTTAAC AAGTTGATCT GCAAGTTCCT GACTATCTTG ATCTTGTACG 600 

ATCGCTATAA TCATTTTCAT TTTATAACAC CTCTTCTAAC AATTATATCA TATCTTTTCT 660 

AAATATTTGA TGATAGTTTG ATACGTGTCT TCAACAACAT TTTCAAGAGG TTGATCTG CA 72 0 

J5 

TTAACGCTTT TGAACCGTTG TG ATT CATT A TGAATGATTT CTTGGTAACC TT CAATT ACT 780 

TTTTCGTGAA cTTTAAATCT TCTTGATCTA ATCTATTTTG ATCTCTTGAA TTTTTAATAA 84 0 

20 TACGTTCGCG ACCTACTT CA GCACTAACAT TTAAATAAAT CGTCAAGTCT GGaTATAAT C 900 

C ATTTATTG C AAATTCGTTT AATGCTCTTA CTTCTTCAAC GCCAATCCCT CTAGCATAAC 96 0 

CTTGATAAGC TAATGAACTA TCGATATAGC GATCACACAA CACAACCTTA CCTTCTTTTA 1020 

25 AAG CTGGTAT GACCTTTAAT ACAAGATGTT CTCTTCTAGA TGCAGCAAAT AACATTGcTT 10 80 

CaGTtCTAAT GTCCATATCA TTGCCcTTCT AATACAATTT TACGTATTTC TTCACCAGTA 1140 

GGAACACCAC CTGGTTCTCT AGTCATAATG ACATCATAAT CTTTTACTAA TCTATGGTAA 12 00 

30 ACTTCATTAA TTACAGTTGT TTTTCCAGAG CCTTCTGGGC CCTCAAAAGT TATAAAAGCT 12 60 

GACATTTTAT TCATCCTCAA CTAAAATTTT ATTATTTTTA ATTCCTTCAA CTATCATTCC 1320 

AGTTTCCAGA TATTCATTTA CCAATTCTAT CATATTTTCA GTTATTGTTT CTCCTTTGAA 13 80 

35 

AATAATAGGA ATcCCTGGcG GATACGGGAC AATATGTCGC GCCAGAACTT TACCTTTgCC 1440 

TTTTcAAATC ACACCAAGTA ACATATTCAA AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 1500 

TTAAAAGTTG TGTTTGTTTA ACTTTAGAAA CTGATTTTTT CGGTAAAATC ATATCTTCAA 1560 

TTTTACGTAA AAGCGAATCA AATAAATACG TATCATCATG ATGCCATAAC GGCAATATCG 1620 

CTAATG CTTG ATAGTCGTCC GCTAATTCTA AATAGATATG TGCATTCATA AACCAATTTT 16 80 

4S GAATATCATG ACCTGTAAAA CCTTCATATT TTATCAGCAA CTTCAACGGA TCATCAACTT 174 0 

GAAGCATTTC AAAACCCTTC TTCTCCAAAC ATTCGATTAA TTGCGCTCTC TTATCAAAAA 1800 

ACACGGTACT ATCATATGTT TTATAGAACT CGGCAGCTGA CTCTAAACTA GCCATAATCA 1860 

SO AATACGAAGG ACTAGATGTT TGGAAGTAGC TTAGATATTC TATAATAGTT TCTCTATAAG 192 0 

GTGCATTTTT ATGAATATAA AGTACCGAGC CCATCGTTAA AGCTGGTAAC GTTTTATGAA 1980 

£5 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 2100 

TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 2160 

5 GTTTGTGACC GTCATTATTC AAACGGCTTA AATTAACTTT ATTATAATGA TTCGTTAACG 222 0 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 2 2 80 

CAGATTTATG TACATTTCTT GCCATTAAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 23 4 0 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 24 00 

GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 24 6 0 

AATCCAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 2 520 

15 

ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 2580 

TTATTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TTTGGAATTA 264 0 

20 GTTACTTTCA AAAGTATTAA TTATATAGTA ACACTTCTTT GACAAAAGTT AGTGTTACTT 27 00 

ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 2760 

TATTAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 2B2 0 

25 GCTGTTCACT TTTTATAATA CTTCTGAATG TCTTCACTTA TACTTCTAGT CACAGATTTA 2 88 0 

AATAATCAAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 2 94 0 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 3 00 0 

30 AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 3060 

ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 312 0 

TTCACTTCGC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT TTAGACATAA 318 0 

35 

AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 3 24 0 

TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATCGCTTGC TTCTTTCCTC 3 30 0 

TCCTTCGGCT CTCGCTTACT CATTTAGCTC TACT AAA CTC GTTGCGCTCT TTTCTCGTTT 3360 

40 

CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 3420 

TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 34 8 0 

4S TTTTTGCCTG GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 3540 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 3600 

ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC GTCAGATTCA AACGTTTTCA 3660 

SO CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 3720 

AAGAGACCTT GCGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACGTTCTA 3780 

55 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3900 
TTCAAAACTA GATAGTAAGT g AAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3 96 0 

5 AAGTCTTCGA TCGATTAGTG ATTCGTG CAG CTCCACATGT GCACCATGCT TGCCACCTCG 4020 

GAACCTATT 402 9 

(2) INFORMATION FOR SEQ ID NO: 302 : 

to 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 



20 


GATGCAATAG 


TTGAGTAGTT 


AT AAGTAG CT 




GCTCAAATAT 


AGGTTCAAGA 












CAGAGTGCAA 


ACGCTTGACT 


CAATAACGAA 












ACAGGaGGTG 


CAAAGTATGT 


TTG CTATT AT 












AGAAGGTCAA 


GAAATCTTCG 


TTGAAAAATT 






AGACGTAAAC 


GAAGGAGATA 


CTTTTACATT 


TGATAAAGTA 


TTATTTGTAG 


GTGGAGATTC 


300 




AGTTAAAGTT 


GGAGCGCCAA 


CAGTTGAAGG 


TGCAACAGTT 


ACTGCTACTG 


TTAATAAACA 


360 


30 


AGGTCGCGGT 


AAAAAAATCA 


CTGTATTCAC 


ATACAAACGT 


CGTAAAAATT 


CAAAACGTAA 


420 




AAAAGGCCAT 


CGTCAACCAT 


ACACTAAATT 


AACAATCGAT 


AAAATCAACG 


CGTAATTATT 


480 




ATGATTACTG 


TTGATATTAC 


AGTTAATGAT 


GAAGGCAAAG 


TAACAGACGT 


TATTATGGAT 


540 


35 


GGCCATGCTG 


aCCATGGTGa 


ATATGGTCAT 


GATATCGTTT 


GTGCTGGAGC TTCAg CTGTA 


600 




TTGTTTGGTA 


GTGTTAATG C 


GATTATAGGA 


TTGACATCTG 


AGAGACCAGA 


TATCAATTAT 


660 


40 


GACGACAATG 


GTGGTCATTT 


TCATATAAGA 


AGCGTTGATA 


CAAACAACGA 


TGAAG CGCAA 


720 




CTAATTCTTC 


AAACAATGCT 


TGTGTCTTTA 


CAAACTATTG 


AAGAAGAATA 


TAATGAGAAT 


780 




ATTAGATTAA 


ATTATAAGTG 


AGGTGCATTC 


CGATGTTAAA 


ATTAAACTTA 


CAATTCTTCG 


840 


45 


CATCTAAAAA 


AGGGGTAAGT 


TCTACAAAAA 


ACGGACGTGA 


CTCTGAATCA 


AAACGCTTAG 


900 




GTGCTAAACG 


TGCTGACGGT 


CAATTCGTAA 


CAGGTGGTTC 


AATTTTATAT 


CGCCAACGTG 


960 




GTACTAAAAT 


TTACCCTGGT 


GAAAATGTAG 




CGATGATACA 


TTATTCGCTA 


1020 


SO 


AAATCGACGG 


CGTTGTTAAA 


TTCGAACGTA 


AAGGTCGCGA 


CAAAAAACAA 


GTTTCTGTAT 


1080 




ATGCAGTAGC 


TGAATAATTT 


TGTCTAGTTA 


ACACCAGAAG 


TGAATCTTCT 


GGTGTT tTTT 


1140 
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AGACGTTATA CTAAATGTGC AcTGTATAAG AGCCCCTAAT CACTAAACTA TAAGGGGGAC 12 6 0 

AAAGGAATAC AGTTGCAGCG TTTAAAGAAT AAACTGTACC ACAATTGGTG CTGAGAAATA 13 20 

5 

TAAGTATTTT AAAGCAAAGA TTTATAAAAG TAACTGCATA AGAGCCCCTA ATTATTTACA 13 BO 

ATATATAAGG GGCTCTAATA TGCTATAATT ATTGGGAAAA TGAAAATTAT ATGTAAAAGA 14 4 0 

GGTGAGATAT ATGTTTGTCG ATCAAGTCAA AATATCTCTT AAAGCCGGTG ATGGTGGTAA 15 00 

w 

TGGTATTACC G CATACAGAA GAGAAAAATA TGTACCATTT GGTGGACCAG CTGGCGGTGA 156 0 

CGGTGGTAAA GGTGCTTCAG TCGTATTTGA AGTGGATGAA GGTTTAAGAA CGTTATTAGA 16 2 0 

, s TTTTAGATAT CAACGTCATT TTAAAGCAAG CAAAGGTGAA AATGGCCAAA GTAGTAATAT 1680 

GCATGGTAAA AATGCGGAAG ATTTAGTATT AAAAGTTCCA CCTGGTACAA TTATTAAAAA 1740 

TGTTGAAACA GACGAAgTGT TAGCAGATCT TGTTGAAGAT GGTCAAAGAG CTGTAGTAGC 1800 

20 GAAGGGCGGT CGAGGTGGCC GAGGTAATTC ACGTTTTGCA ACACCTAGAA ACCCTGCACC 186 0 

TGACTTCAGT GAAAAAGGTG AACCAGGTGA GGAATTAGAT GTATCTTTAG AATTGAAATT 1920 

ATTAGCTGAT GTAGGATTAG TAGGTTTCCC TAGTGTGGGT AAATCGACTT TATTATCTAT 1980 

25 CGTTTCAAAA GCTAAGCCTA AAATTGGGGC ATATCATTTT ACAACGATTA AACCAAATCT 2 04 0 

AGGTGTTGTT TCAACGCCTG ATCAACGTAG TTTTGTTATG GCAGATTTAC CAGGTTTAAT 2100 

TGAAGGTGCA TCTGATGGCG TTGGATTAGG ACATCAATTT TTAAGACATG TAGAGAGAAC 2160 

30 

AAAAGTTATT GTTCACATGA TTGATATGAG CGGTTCTGAA GGTAGAGAAC CTATTGAAGA 222 0 

TTATAAAGTC ATTAATCAAG AATTAGCTGC GTACGAGCAA CGTTTAGAAG ATAGACCTCA 2 2 80 

AATCGTAGTA GCTAACAAGA TGGATTTACC TGAATCACAA GATAATTTAA ACTTGTTTAA 2 34 0 

35 

AGAAGAAATT GGCGAAGATG TGCCAGTTAT TCCAGTTTCA ACAATAACGC GTGATAATAT 24 0 0 

TGATCAATTA TTATATG CAA TAG CAGATAA ATTAGAAGAA TATAAAGATG TTGACTTCAC 24 6 0 

4Q AGTTGAAGAA GAGGAGTCAG TTGGCATTAA CCGAGTATTA TATAAACATA CACCGTCACA 25 20 

AGATAAATTT ACAATTTCAA GAGATGATGA TGGTGCTTAT GTGGTAAGTG GTAATGCTAT 2 5 80 

TGAAAGAATG TTTAAAATGA CTGACTTTAA CAGTGATCCA GCAGTACGTC GATTTGCTCG 2640 

45 TCAAATGCGT TCGATGGGTA TTGATGATGC GCTTAGAGAA CGTGGTTGTA AAAATGGTGA 2 7 00 

TATCGTTAGA ATTCTTGGCG GAGAATTTGA ATTCGTTGAA TAGGAGCGAA ACATGATGGA 2760 

CAATAAAGAT TATAAAAAGT TTTATTTAAT TAGAGAAGAT GTCTTGCCTG AATCCGTGGT 2 8 20 

50 TAAAACATTG AAGATTAAAG ATGCCTTAAA AAGTGATCCG ACATTGTCCA TTTATGATGC 2 8 80 

CGTTAAACAG TTTGATCTAT CTAGAAGTGC TTTTTATAAA TATAGAGAAA CGATATTTCC 2 94 0 

ss 
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TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 3 0 60 

GATTCATCAA AGTATT CCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 3 24 0 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GG TGTTGGTT 3 3 00 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3360 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGTCCG AAATCAGCTT 34 8 0 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGCCATTGAA AATGAAAATG 3540 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 36 00 

ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3 66 0 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAG CGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 378 0 

CAGTTGATGA AGCTGTTAAG GCAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 384 0 

AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3 900 

ACTGATTTCG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3 960 

ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4 020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATCTAA TATCATTGCC 40 80 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 414 0 

TTGG CTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACC CTGCAA TGGAAGATTT CTTTTTAGAT 4260 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4320 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 4380 

GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAAT CAT TATTAGAACA 4440 

GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 4 500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4 560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4620 

GATCAACACG GACTAGATTA CATTGATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 4 6 80 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 4 74 0 
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GGCAGAAAAG CAACACCATT AGCTTATGAA CATTTTGCAA AGTCGAATGA GGAGAGAGAA 4 360 

TAACGTGAAT ATTGAAGAAT TTGACTATGA CTTACCAGAA TCATTAATTG CTCAAACGCC 4 92 0 

£ 

TTTAAAAGAT CGTGATCATA GTCGTTTATT AGTCATGGAT AGAGAAACTG GTGAAATGAA 4 98 0 

ACATTTACAT TTCAAAGATA TCATTGAGTA TTTTAGACCT GGTGATACAT TAGTGCTTAA 504 0 

CGATACGCGA GTAATGCCAG CTAGACTTTT TGGTTTAAAA GAAGAAACTG GTGCAAAAGT 5100 

10 

TGAAATGTTA ATGTtAACTC AAATTGAAGG TAATGATTGG GAAGTCTTAC TGAAACCAGC 5160 

TAAGCGTATT AAAGTTGGTA ATAAATTGAA TTTTGGTAAT GGCAAAATTA TAG CTG AATG 5220 

, 5 CATAAAAGAA ATGGATCAAG GTGGACGCAT CATGCGTTTA CATTATGAAG GTATTTTACA 52 80 

AGAAAGATTA GATGAATTAG GGGAAATGCC ACTGCCACCA TACAT CAAAG AACGTTTAGA 5 34 0 

TGATCCAGAT CGTTATCAAA CAGTTTACGC TAAAGAAAGT GGTTCAG CGG CAGCACCAAC 54 00 

20 AG CAGGATTA CATTTTACTG ATGAGTTATT AATTGAAATT AAAAATAAAG GTGTTAATAT 5460 

CGCATTTGTT ACATTACATG TTGGGTTAGG TACGTTTAGA CCGGTGAGCG TAGACGATGT 5520 

GAATGACCAC GAAATGCATA GTGAATATTA TCAAATGACm CAAGAAACAG CTGATTTATT 5 5 80 

25 AAATGATACT AAGcCAAAGG ACATCGCATT ATATCAGTTG GTACAACTTC AACACGTACA 5 640 

CTTGAAACAA TTCGACGCGA T CATGAT AAA TTTGTTGAAA CGAGTGGCTG GACTAATATA 5700 

TTTATTTATC CAGGATTTGA TTTTAAAGCA ATTGATGGCC AGATTACTAA TTTTCATTTA 5760 

30 

CCAAAATCAA CATTAGTTAT GCTAGTATCA GCGTTTAGTA GTC GTGAAAA TGTTCTGAAT 5820 

GCTTATAAAA CGGCAGTAAA TTTAGAATAT AGATTCTTTA GTTTTGGCGA TGCAATGTTA 5 880 

ATTATATAAA AAGAATGTGA GGATTTTGAA TATGCCTGCA GTAACATACG AACACATTAA 5 94 0 

35 

AACTTGTAAA CAATCAGGTG CGCGTTTAGG TATCGTGCAC ACACCACACG GTTCATTTGA 6 00 0 

AACKCCTATG TTTATGCCAG TTGGTACTAA AGCAAC CGTT AAAACAATGA GTCCAGAAGA 6 060 

40 GTTAAGACAA ATTGAAGCAA AAATCATTTT GGGCAACACA TAT CATTTGT GGTTACAACC 6120 

CGGAAATGAT ATTATCAAAC ACGCTGGGGG ATTACATAAA TTCATGAATT GGGATGGTCC 6180 

GATTCTTACA GATTCAGGCG GTTTCCAAGT GTTTAGTTTA AGTAATTTAC GTAAAATTAC 6240 

45 AGAAGAAGGC GTGGAATTTA GACATCATAC TAATGGGTCT AAATTATTTT TGAGTCCTGA 6 3 00 

GAAATCAATG CAAATTCAAA ATGATTTAGG ATCTGATATT ATGATGGCAT TTGATGAATG 6 3 60 

TCCACCGATG CCTGCTGAAT ATGATTATGT AAAAAAATCT ATTGAACGTA CAACACGTTG 6420 

50 GGCGAAAAGA TGTCTAGATG CACAC CAAAG ACCTGAAGAT CAAGCATTGT TCGGCATTAT 6480 

ACAAGGTGGC GAATATGAAG ATTTAAGAGA ACAAAGTGCA AAGGATTTAG TAGAATTAGA 654 0 

55 
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AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 66 60 

AGGATCTCCa GATG CGTTAA T CG AATG TAG TATTCGCGGC ATGGATATGT TTGATTGTGT 6 72 0 

5 CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 67 8 0 

TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 684 0 

ATGTCAAAAC TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 6 900 

w 

TATTCGTCTT ACTA CT ATT C ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 6 96 0 

ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 702 0 

1S ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 7080 

ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 714 0 

ACAAAAACTG TGCGAAACA 7159 

20 (2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3159 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

30 

TCCATTTATA GAAATTTCTA AAGCAGAnAA 

AATCATCAAA AGAACACAAG ACATTACTAG 
35 ATTTGATTTT GTAAAAGATA AAAAATATAA 

CTT CATTAGC TCTAGAaCAA CATATTCAGA 

GATATGGCCA TTTCAATATA ATATCAGTTT 
40 TAATTATCTT CCTAAAAATA rAATTGATTC 

TATCGGCGGA AACTTCCAAT CAGCGCCATC 

TAAAACAATT AGTTATAATC AAAAAAACTA 

45 

AGGTGTTAAA TGGGGAGTGA AAGCAAATTC 
ATATGATCAA TACTTATTTG CACAAGACCC 
50 CCCAGATAAT CAACTACCTC CTTTAATTCA 

ATTGTCACAC GAAAGAGGTA AAGGTGATAA 
CATGGATGCT ACATATGCTT ACGTGACAAG 

55 



TAAGATAGAA GATATCGGCC 
CAAACGATTA GCTATAACTC 
CAAAGATGCC CTAGTTGTTA 
CTTAAAAAAA TATCCATATA 
GArAACGAAA GACTCTAATG 
AG CAG ATGTT AGTCAGAAAT 
AATCGGAGGC AGTGGCTCAT 
TGTTACTGAA GTAGAAAGTC 
ATTCGTTACA CCGAATGGTC 
AACTGGTCCA GCAGCACGAG 
AAGTGGCTTT AATCCATCAT 
AAGCGAGTTT GAAATcACTT 
ACATCGTTTA GCCGTTGATA 



AAGGTGCAGA 60 

AAAACATTCA 12 0 

A GATG CAAGG 18 0 

TTAAAAGAAT 24 0 

TTGATTTAAT 300 

TAGGCTATAA 36 0 

TCAACTACT C 420 

AGAACTCTAA 4 80 

AAGTATCTGC 54 0 

ACTATTTCGT 6 0 0 

TTATTACAAC 66 0 

ACGGCAGAAA 720 

GAAAACATGA 7 80 
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AAAAATTAAA AGCATCACAC CTAAGTAAAC AGTTCaATCA TCTTAAAAAA TCCTGGGACA 900 

CTTCATACTT GTCTCAGGAT TTTTTmACAA ATTGAATCAG CCTCATAACA TTAAATTATT 96 0 

TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 102 0 

TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 108 0 

TCAAGTTAAT AAA C AATTAA TTTTTAGATG GATTCATCAA AAATCGTAAA AAAGCACAAT 114 0 

TTGTATTTTA CAAACATTAA TTAAAAAAGA AAG CAAGACA TTCGTGCAAT CGGTTACCTT 12 00 

AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 12 SO 

GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 132 0 

ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 13 80 

AAATATATTA TTATTTAATT AAGTTAAATA AAATTATAGA AAGAAAGTGA AACTTATGCT 144 0 

TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TTGCCAATCC 150 0 

GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG G AAG CGATAT 15 6 0 

AGAAATTATC AAAAGGACAG AAGATAAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 162 0 

TCAATTTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT GCTTTGATAT TAAAGATGCA 1680 

AGGATTCATT AGCTCTAGAA CAACATATTA CAACTATAAA AAAACTAATC ATGTTAAAGC 174 0 

TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATGTTTCTTT 180 0 

AATTAATTAT TTACCTAAAA ATAAAATTGA ATCTACAAAC GTGAGTCAGA CATTAGGATA 1860 

CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 192 0 

TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 1980 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACTGAATCAG GTCAAAAATC 204 0 

AGCCTTTGAT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATTA 2100 

TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGATTTAACC CTTCATTTAT 2160 

CGCCACAGTA TCTCATGAAA AAGGTTCAAG CGATACAAGC GAATTTGAAA TTACTTACGG 2220 

AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA A CAGTTATTT 228 0 

AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATAC GAGGT 234 0 

CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT TGATATGAAA ATGAATAAAT 24 00 

TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 24 60 

CTGAAGGTAA AATAACACCA GTCAGCGTAA AAAAAGTCGA TGACAAAGTT ACTTTATACA 252 0 

AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 2580 
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ACTCAGGCTT TGTGnAACCT AATCCTAATG A CT ATGACTT TTCAAAATTA TATTGGGGAG 2 7 00 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 2 7 60 

CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC TTTAGGCTAT ACATTTGGTG 2 820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 2 8 80 

AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2 94 0 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 30 00 

GAAGAGATAG CTTCCACCCA ACATATGGTA ATGAACTCTT CTTAG CTGGC AGACAAAGCA 30 SO 

GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATTA TCTAGAAGTA 3120 

ACTTCAATCC AGAATTTTTA AGC GTACTAT CACACAGAC 315 9 
(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

GCAAAAACTT TTCTCCAACT ATGTCAGACT ATAATGGCAT GAACAAAATT GATATGATGA 60 

ATCAAATTAA AGTTGATACG ATGTTGCATG GTTATCACGC TGGATTTTTA TTTGCATTAT 120 

TGATTACAGT TGTTAGTTTC TTTTGTTCAT TTATGTTACA AGGTAAGAAA AAAGAAGTTG 180 

ATTCTCGTCA ATAAATATAA GTTGCTAGCT ATATAAAGCT TTTTAGCAAA AGTTCAACAT 24 0 

ATTGACTTAT CCGGCATTTC AGATTAAAAT ATTTTTATTC CGATTAGAAT AATAAGAATA 3 00 

AGGAGATATA TTCTATGAAA AGACTTTTAT TTGTGATGAT AGCTTTCGTT TTCATATTGG 3 60 

CTGCATGCGG AAACAATTCG TCGAAAGACA AGGAAGCTAG TAAAGATAGC AAGACAATTA 4 20 

ATGTTGGGAC TGAGGGGACT TATGCACCAT TTAGTTTCCA CGATAAAGAT GGTAAATTAA 4 80 

CTGGTTACGA TATTGATGTT ATTAAAGCAG TGGCTAAAGA AGAAGGTTTA AAACTTAAAT 54 0 

TTAATGAAAC TTCTTGGGAT TCTATGTTTG CAGGTTTAGA CGCAGGGCGT TTTGATGTAA 6 00 

TCGCGAACCA AGTAGGTATT AATCCTGATA GAGAAAAGAA ATATAAATTT TCTAAGCCTT 66 0 

ACACATTCTC AAGTGCTGTT TTAGTTATTC GTGAAAATGA AAAAGATATT AAAGATTTTG 72 0 

ATGATGTTAA AGGTAAGAAG TTAGCACAAA CATTCACATC TAATTATGGT AAATTAGCTA 7 80 

AGGATAAAGG TGCTGATATT ACAAAAGTTG ATGGCTTTAA CCAATCAATG GATTTATTAT 84 0 
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AAAAACCTAA TGcTAAGATC AAAGCAATCA AAGGTAATGC TGAACAAAGT AGATCTGCAT 9 60 

TTGCATTTTC TAAAAAAGCA GATGATGAAA CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 10 2 0 

AAATCGAGGA AAACGGTGAA TTAGCTAAAA TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 10 80 

CTAAATCTAA ATAGCGAACA ACAACATGCG CTAGATGCTG CAAAACAAGC TTTCGGACCT 114 0 

ATGCTAGAAG GTTTGGTCAA ATATTCAATT CCTATTACAT TAG TT ACATT TGTTTTAGGA 1200 

TTGATTATTG CATTATTTAC AGCATTAATG CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 12 60 

ATTTCACGTG TCTATGTATC TATTATTCGA GGAACACCAA TGATAGTACA ACTATTTATC 13 2 0 

ATAT TTTATG GTATTCCAGA ATTAGGTAGA TTATTAACAA ATGACGCTGA CAACCAATGG 13 8 0 

ACATTGGCAC CTGTAGTGGC TGCTATTATT GGTTTATCAT TAAATGTAGG TGCGTATGCT 144 0 

TCGGAAATTA TTCGTGG CGG TATTATTTCT AT AC CGAAAG GACAAACAGA AGCTGCaTAT 1500 

TcCaTCGGTA TGACGTATGG TCAAACGATA CAACGTATCA TTTTACCGCA GGCAATTCGA 1560 

GTGTCGATTC CTGCACTAGG TAATACATTT TTAAGTTTAA TCAAAGATAC ATCATTATTA 1620 

GGATTTATTT TAGTGGCTGA AATGTTTAGA AAAGCTCAAG AAGTTGCGTC TACAACATAT 16 8 0 

GAATATTTAA CAATTTATGT GTTAGTTGCG CTAATG TACT GGGTGGTATG CTTTATTATT 174 0 

TCAATTATCC AAGGTATCTA TGAATCTTAT ATTGAAAGAG GGTATCGCTC ATGATTCAAT 18 00 

TGAACAATAT CCATAAATCA TTTAATGATG TTGAAGTCAT CAAAGGTATT GATTTATCTG 186 0 

TTGAACAAGG TGAGGTTGTA ACCTTAATCG GTCGATCTGG TTCAGGTAAA ACAACATTGT 192 0 

TACG TATGAT TAATGCATTA GAAATTCCAA CTGAAGGTAC AGTTTATGTT AACGGCAAAA 198 0 

CATATACATC TAAAGATAAA AAATCACAAA TAGAAGTTCG TAAACAGTCT GGTATGGTAT 204 0 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

TCACAGTTAA AAAGTTGAAA AAGGATGAGG CACGTGGGAA ATCACTTGAG TTACTTGAGA 216 0 

AAGTTGGTTT AACACATGTC AAAGATCAAC GTCCACATGC ATTATCAGGT GGTCAACAAC 2220 

AACGTGTTGC TATTGTCAAG AG CACTAGCA ATGAACCCTA AAGTGATGTT GTTTGATGAA 22 8 0 

CCAACATCTG CACTTGATCC TGAACTTGTG AATGATGTTT TAAAGGTTAT TAAAGATTTG 234 0 

GCTAATGAAG GCATGACAAT GGTCATTGTG ACACATGAAA TGCGTTTTGC TAAAGAAGTA 24 00 

TCTAATAACA TTGTATTTAT TCmTGAAGGC ATGATCGGAG AACAAGGGGC TCCAGAAGAG 246 0 

ATGTTCAATC GTCCGAAAAC AGAAGAATTA AGACGTTTCT TAAATGTTAT AAATGAAGAA 2520 

TAATCAAATA GAACCACGTA TCATGTTTTA GTATGGCGAT GAAGCCATAT ACATGATGCG 2580 

TGGTTCTTTG TTATGTTGTC ATAATCTTGG AGCGATATTT TAACGACGTT TATGATTTAA 264 0 
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TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACG C CATAGTCTT CTGATGACTC 27 6 0 

TGAAAAGTCA CCGGTAATAC CATAAAAATT ATAG CGATCA ATATGATGCG CTTTAGCAAA 2 82 0 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 288 0 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2 94 0 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG TCGATTTTAC GCTTATTGCT 3000 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 30 6 0 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AG CTGTTAAT TGTTGTTGCT TTAATTGTAA 3120 

CGTTTTTAAA TACTCGTTTA AATCAATATA CGCCAACTTT AACATGGCGT GGTCATCGTA 3180 

TAACTTTTGC ATTTCTTCAA AGTATGGTAA CTCACGGAAT TTGAAACCGT GCTTTTCCTC 324 0 

AGC CATATGG AATAAGTCGA AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

GACACCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 336 0 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AG C CAACGGA TTTGGCTCAT 34 20 

TGAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 34 8 0 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTG CGT A AATTTTCAAT 354 0 

TAAATATGGA TCTACAAGGA CATATAAACA ATTGTGTTTC TTTAAATATG ACGTTAATGC 3600 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC CGCGATGTGT 366 0 

ATAAAAATAT TTGAAAAATT TAAGTGTGCG TGCTTCTGTC AATAAGCATC CTGCAATCAC 3720 

TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTTAAAT CAACTCTATG 3780 

ATTGTAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3 821 
(2) INFORMATION FOR SEQ ID NO: 3 05: 

~ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

GCGACACCAT TGTTTGCAGC AA CTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 

GGTTTTGATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 12 0 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 18 0 

CTT CTTAAAT ATCCATGTGG AT cGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 24 0 
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TATTGTAATT 


GTGTAGCTGC 


ATAGTCAGCA 


TAGTTATTCA 


TTGAACGTGC 


AAATGAAGCA 


360 


TAGTCGTGTG 


TGTG TACG AT 


TTCAACATTG 


ATGAATCTAG 


GGTTACCGAC 


TGCACCGACA 


420 


CCCCAAGATA 


AGTAATCCGT 


TGGTGCTGTT 


TCGATTATAC 


GATCCCCATC 


AACAAATGCA 


430 


TGTACGAATG 


CGTTTTGATA 


GTTATTTTTC 


ATATAACTAA 


TTTCACCATT 


TATCGTCGAA 


540 


CGATCATTAG 


CTGTATCATG 


AACTACGATA 


CCTTCAGGAC 


GACCTACGCC 


GTTACGGTAT 


600 


GCGTATTTAG 


GGAAG TAAGA 


TGTATAATCT 


TCTTCAATTT 


TAGGTGCTTT 


TAAGTTATTT 


660 


TTACAAATGT 


AATCGTTAAT 


TGAAGAGTTT 


ACTTGTGGTT 


TATATTTTGG 


CAAACTCGTT 


720 


TTTGGTGTTG 


CAGCAACTGA 


TCTTGGTTGT 


GCTGAAG CGC 


TAAAAGTAGT 


TA CTTTAGGT 


780 


GTCGCTTCAG 


TTTTAGCTTT 


AGGTGCTGAT 


GTAGTTGCAG 


CTTTAGGTGC 


TGCGGTTTTA 


840 


TATTGCGTyT 


CAAGAGCTGC 


AGGTTTAGCA 


GCTGATTTAA 


TTAATTCTGG 


ATTAATTTGA 


900 


TTTTCTGAAT 


TATCATCTTC 


ATCAT CAACT 


AAACTATAAC 


CAGCATTTGT 


AACATTAGTG 


960 


TTAGTTTTAG 


GTGCTGTAGT 


GCTTGTTGA C 


TTTGCAACAG 


GCTGCGTATT 


ATTTGTAGTC 


1020 


GCTGATTGAT 


TAGCACGAGT 


GTCACCATTT 


ACTTGTG CAG 


TATCAACTTT 


TTG A CTTACT 


1080 


TGAG CATTGC 


CTGTTTTGTT 


ATTTGCTGTT 


TTTGGTTGGA 


CAATAGCAGG 


GTCTTGATAT 


1140 


ACTTGAGTGC CAGAAATGTT 


TTGCGTTGGA 


TTTTTTACCT 


CAGCTTTTGC 


TTGTTCAGTA 


1200 


GTTGCTTTAA 


CTTTATTACT 


ATCTAAAACG 


TTTTTATTAG 


TAGTTTGATC 


TTGTGTCGTC 


1260 


TCAGCTGCTT 


GAACTTGATG 


TGCAGTGACT 


GCTGAACCTA 


CAAGCGTTAA 


TGCAACCATT 


1320 


GAgGTAGTTT GTAATTGAAT 


TTTTTCGCCA 


TTCTATTTAT 


TACTCCyAAC ATTTATTAAT 


1380 


TAT t ACT AAC 


ATTATAGTAC 


CTG TnTTATA 


TACCTGTGCG 


TA 




1422 



(2) INFORMATION FOR SEQ ID NO: 306: 

-(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6076 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 60 
AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 12 0 

50 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 180 
GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 240 

55 
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GTAGATAATC GAATTATTCC TAAAAATATA ACTCAAAACA CAATCTTCAA ATTGAGTAAT 360 

TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT AT CACGG AG A CTTTGCTATT 4 20 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AATTAAT CAT 4 30 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 54 0 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT S00 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACGGAA AATTGCCTTA TTGATGAATT 6 60 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCGTCCTT ATTGTAATTT TCCAACTACT 72 0 

CAAATTAAAA ACCACCCGTG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 73 0 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 84 0 

TTACGATACG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 900 

TCATACCGTC GATAGATGTT ACTGCACGAA TACCTACTGT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 108 0 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 114 0 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT 120 0 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGT CTTCT GGTAATCCAC 1260 

CAACATT GTG GTG TGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 13 2 0 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 13 8 0 

ATACGTATAC AAATTCATTA CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 144 0 

TTAATTTATT CATAAAGCGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 150 0 

CACCGAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGCAAGCCA TGGTCTACAA 15 6 0 

AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACGCCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 168 0 

CAATTTCGAT TTCGATAAAG TTTTCGATTG TCCATTGACC TCTACAATCA CAAACACGAC 174 0 

GGACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCTTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 186 0 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTCATCCAAA 192 0 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACCGTATTC A CGTT CATTG GCACGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 204 0 
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TAAATGAACC TTCTTCATAA ACTGAATTTG GACCACCTGA TAAGATAATA CCTTTTGGAT 2160 

TCATTTTCTT AATTTCTTCA ATTGAAATTT CATGATCGTG TAATTCACTA TAAACGCCCA 2220 

TTTCACGAAT TCGGCGTGTA ATTAATTGGT TGTATTGGCT ACCAAAGTCT AAGACAAGGA 22 80 

TTAACTCTTG TTCTTTTGCC ATTTCCATAT TTGTCGTTCT CCTTTATCTT AATTAGAATG 2 34 0 

AGTAGTTCGG TGATTCTTTC GTAATTTGAA TATTATGTGG ATGGCTTTCT GCTAAACCAG 2400 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT CGCGTAATTC TCTTAAATCG TGTGAAC CAG 24 60 

TATAACCCAT ACCAGCACGC ACACCGCCCA TTAATTGGTA AATTGTATCT TGTAACGCAC 2520 

CTTTATAAGC CGTACGTCCT TCGATACCTT CAGGAACAAA TTTCTTAGGC GCTTTGTCCT 2 58 0 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT CCATCGCACC TAAAGAGCCC ATACCACGGT 2640 

ATACTTTATA TTGTCTACCT TGGAAAATTT CTGTTGCGCC TGGGCTTTCT TCAGTACCTG 270 0 

CTAATAAGCT ACCTAACATA ACCGCATGTC CACCAGCAGC TAATGCTTTA ATGATATCTC 2760 

CTGAGAATTT AATACCACCA TCAGCAATGA TAGCTTTACC ATGTTTGCGT GCTTCAGTCG 2820 

CACAATCATA AATTGCTGTA ATTTGTGGTA CACCAACACC TGCTACAACA CGCGTCGTAC 2880 

AAATTGAACC TGGGCCAATA CCAACTTTAA CAATAT CTG C ACCCGCTTCA AATAAATCTT 2 94 0 

TTGTTGCTTC TGCAGTTGCT ACGTTACCTG CTACTAATGT GATTTCTGGG TAAGTCTTCT 3 000 

TAATATGTTT CACTTGATCG ATAACACCTT TAGAGTGACC ATGTGCTGTA TCGATAACTA 3060 

AGACATCCAC ACCTGCTTCG ACTAATTTTT GAG CACGAAT ATCAGTATCT TTTGAAATAC 3120 

CAATTGCTGC GGCTACAAGT AGACGACCAT GTTCATCTTT TGCTGCATTA GGGAATTCGA 3180 

TAACTTTTTC AATATCTTTA ATAGTAATAA GACCTTCTAG ACGTCCGTCT TTAACTAATG 3 240 

GTAACTTTTC AATCTTATGT TTTTGGAGAA TTTTTTCTGC TTCTTCAAGT GTTGTATTCA 33 00 

CTGGAGCTGT AATTAAATTt TCTTGCGTCA TTACATCTAC AATTTTAATC GAGAAGTCTT 336 0 

CAATAAAACG TAAGTCACGG TTTGTTAAAA TACCTACTAA GTTGCGATCT TCTTTATTAT 3 4 20 

CAACAATTGG TACACCTGAA ATACGGTATT TACCCATTAA TGCTTCTGCT TCATAAACGC 34 80 

TTTCTTCTGG CGTTAAGAAA AATGGGTTTG AAATGACACC ATTTTCTGAG CGTTTTACTT 3 54 0 

TTTGAACTTC GTCCGCTTGT TCTTCAACGC CCATATTTTT ATGAATAACA CCTAAACCAC 36 00 

CTTGACGAGC CATAGCAATC G C CATTTT AG ATTCAGTTAC AGTATCCATA CCAGCAGAAA 3660 

TAACTGGAAT ATTTAATTTA ACTTTGTCTG ATAATTGTAC GCTTAAATCA ACGTCTTTCG 372 0 

GTAAAATATC AGATTGTGCT GGAATTAATA ACACATCATC AAACGTTAAT GATTCTTTTG 378 0 

CAAATTTACT TTCCCACATT AAAAACAGCC TCCATTTTTC AAATTAATTA GTTATATTAT 3 84 0 
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GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATG CAAATTG TTCACCTAAA 396 0 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA AtCCTACTGA TACAGCGATA 4 02 0 

£ 

ATTAATAAAT TGTTTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 40 8 0 

ATTAC CATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGATATTT 414 0 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 420 0 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC C TACATTTTG CGAATAGGCT 426 0 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCG CTAGACCTTC CGCAGTATAA 4 320 

, 5 CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 4 3 80 

ACACCTGTC G ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA CGTCGATGTC 4440 

ACATCAAAGC TAAATC CAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA ACCGGCATGA 4500 

20 TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4560 

CCAATTAATA TGGCAATACT CTTAATAAAT C CAGTTGTG A ATCTTTGTAA AAGAAGAATA 4620 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4680 

25 TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 4 74 0 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4 800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4 860 

30 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4 920 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4 980 

35 ACTGTAACTA AGTAAG CGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 5 04 0 

GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 5160 

to GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGG AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCG CTTG CTGTGCGATA TCGTATAATC 5280 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 534 0 

45 ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 54 00 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 54 60 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 5520 

50 

AGATTTTAGT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 5580 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 5640 

55 
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ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 576 0 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATG CAATCAA 58 20 

TGTAAAAGAA GTTT CAT CAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5 8 80 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5 94 0 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAA GC CAT AT TCTTTAGATT ATATGAGTTT 6 000 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 6 060 

ATCTTTTAAT AGTTTT 6 076 
(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

ACCGCCGATT GATAGCTTTA CTGCTG CGAT AAAACTGCCT TCAACAATTG GTGCATCAAC 60 

TTTTAACACA CGATGATTAC CATCATACAT TTCAATTGCC ATATCTACAT TCATTTCTGA 120 

AGATCCAATA TCGTAAAAAC A CAA TG CATC ATCCTCTAAT TTAGTCAAAA CTTCTTGGAT 180 

GATATCAAAT GAAGTTCCAA TTGAAC CATC TGGTAATCCC CCGATTGGTA TAATATCAAC 24 0 

GTCACCTGCC ATTTGCTTTA ACAAAGATTT TGTACCACTT GcAATTTCTT TACTGTGGCT 3 00 

AACAAGTATA ATTTTAGGCA TTATCTTCAT CTCCAATCAA AG CGTTTAAA ATATAAACCA 360 

TACTTTGAGC ACCTGGATCA ATATAACCTT TTGATTCTTC TCCAAAATAT GCAG CTCTAC 42 0 

CTTTCGTTGC TACCATATCT TTTGTATTAT CTGCTAATTG CTGTAAATCA TTGAATGTTA 480 

AAGTTTCACC ATTTTTAAGC TTCTCTGCTG CTCGCGCTAC TACATCATAC ATTGTCTTTT 54 0 

CATTTAAAGT AACTTTACCA CGTGATTCAA CCGCTTCGGC AAATG CCTG A ATTAGTGTAA 6 00 

TGAAATCTTG ATTATCCATA TCATCTTTGG TGACTGCAGA CATTTTAACA AAGCTAAAGC 660 

CATACAGTGG TCCTGATGCA CCCCCAACAT TTGACATCAA TGCCATACCA GTTGATTTGA 720 

ACAATGATTG CATTGAGCTA TCATCAAGTT TGTCTTTAAG ACTACTAAAC CCACGAACCA 780 

TGTTAACCCC GTGGTCACCA TCACCAATTG CTCGATCTAA TTCAGTTAAT TCAGATTCAT 84 0 

GTTTTTTAAA C GT T TCTT CT AAATTTAATA AACGTGCTTT CATATCATT C ACTTTCATTT 9 00 

GTGCCACCTC ATAAATGTAT ATTTATTCAT ATTCACTTCT TATTTAAAGT ATTGACTTGT 960 
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ACCTTGCATG TCTAAAGATG TCATATAATC AC CAACAAAC CATTTAGCAA CATTAACCGT 108 0 

TCTTGCAGCT AAATTTTGTT GAATATATTT AGTTACGATA TTTAATTCAG ATAACGGCGT 114 0 

S 

ACCACCCATA CCATTTACCA TTAATATGAC ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

GTCTAACAAT GTTCCAACAA TATGATCAAT ATCCTTTACT TCTTCCCTAT GAATACCTTT 1260 

TTCACCATGT AT AC CAATAC CGATTTCCAT TTTGTCGTCT TCAATATCAA AGCCATATTT 132 0 

w 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT TGCCATACCA ATACTTTTAA TTTCAGGTAA 1380 

CAACGCTTCT ACACGCGATT TTATCTCTGT TAATGAATAA CCTTTTTCAG CAAGATAACC 1440 

/5 GGCAAGCTTA TGAACAAACA CTGTTCCTGC AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AGCAATGTCG TCACGAACAA TAACAGTTTG AACATTTATA CCTTCCATTT CTGCAAGCTC 156 0 

TTGTGCCATT TCGAAATTCA TCACGTCACC TGCATAGTTT TTTACAACTA GTAATACACC 1620 

20 ATCACCAGTA TCTACTGCTT TAATAGCTTC TAATATTTTA TCAGGTGTAG GTGATGTAAA 1680 

TACTTCGCCA CAAACCGCTG CATCTAGCAT ACCTTCTGCA ACAAAACCGG CATGCGCAGG 174 0 

TTCATGTCCG CTTCCACCTC CAGAGACTAT TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

25 TACAATAACT GT ATT AG CAA TCAGATCTAA CTCTGGGTGC GCAATTAACA ATCCTTCAAG 18 60 

CATATCAGTT AAAAATGTTT CTTTTTTATT GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TGGTATTATC AGTCATAGTA TAAAACAGAA TAAATGAATG CGCTAT CAT A AAGAATTAAC 1980 

30 

CTGATACCGT TATCAAATAT ATTTTTGATA AGATCCTCTT GATAACTTAA TGGTTCATTA 204 0 

TTGAAAAAAT AAATAGTCTT GAGTGGCTCA TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

3£ AATAAAGTGC CTTTCATAGC aATAAGTTGT GTCCATTGAC ACTATACATT TTTCGTTTTG 216 0 

TACATTAAAT ATAAGAAATA CGGTGCACCA ATAATTG CTA CGA CAATAC C TGCTGGAACC 2220 

CCACCTGGTT GTAATACAAT TTTGCCAATT GTATCAGCTA TAACAAGTAA ACATGCCCCT 2280 

40 ACTAAAATGG CAATTGGTAA AAACAACTGG TGACGTGGTC CAACGATACG TTTGGCAATA 2340 

TGCGGACCCA TTAATCCGAT AAACGAAATT GAACCTGCTA cTGCTACAGC AGCAGATGAT 24 00 

AACATCACTG CGATAAAGAA TAATATTAAA CGTTCTCTGC TTAACCTTAC ACCTAGACCT 2460 

45 CGTGCAATAT TATCACCCGT ATGAATAATA TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GGAATAATAA TCAACACCCA CGGTAAAAAT GCAATGACAA ATGGCCATTC GTCACCCCAA 2580 

ATATTACCTG CAAACCAAGC AGCGATGAAA TCAGATTGCT TAT CAT CAAA TTTTGACATA 264 0 

50 

ATTGTAATTG AGCCAC CATA TAATGCTGTT TGTAAAC CTA CACCTATTAA TACCATACTC 2700 

GCAGGTGTAA CACCTTCATT TTTATTAAAA CTGaAAATAA AAATAATCaA TGCAGTGgTG 2760 

55 
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CCAATTGCAA TAAATAATGC GATTGCAAAT CCGCCACtGC GTTAATACCT AATATACCTG 2 880 

GTTCAGCTAT TGGATTTTTT GTGACACTTT GCACTATTGC ACCACTAATA CTAAgCtGCG 2 940 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT AACCTGAAAT CTAATAAGAT TAACTCATCA 3 000 

ATGGCATCAC CTTGTCCAAT TAAAGTTTTG AAAAATCTTT CAACAGGTAT GTTGTATTCA 3 060 

CCTGAGGTAA TACTCCAAGT ACAACCTAGA AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

ATCAATTGTC TGCGTTTATT ATTTGAACTA ATCATATTGA GCGTCCTCCT TTTTTAACTA 3180 

AATATAAAAA GTAAGGAACA CCGATAAATG AAATGATTGC ACCAACAGGC GCTTCTCCTA 324 0 

AATATCGTGC TATCACATCG GCAACAAGCA CGAGTATCCC ACCTAACAAG GCTGTTAATG 3 3 00 

GTAGAATTTT AGCATAATCA GTTCCAATTA AAAATCTTGC TATATGAGGT ACCATCAAAC 3 3 60 

CTACAAATGC AACTTGTCCA GCGATAGCAA CTGCAATACC TGCTAGAATC ATAGCAATAA 3 4 20 

TT AAA CAT AT GCCTCTGATC ATTGTTACAT TTTGACCTAA ACCTTTAGCT AATGATTCAC 34 80 

CAAGATTTAA AATGGTAAGT TGTTTACTAA TTGTTAATAT AATGAATAAC GCAATACCAA 3 54 0 

TTAATGGAAT TGCC CACTTA AGGTGTGACC ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3 600 

AAAATGTTAC TGTTTGATTT AGTCTAAAAG CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3 660 

ACATAGCACT TACTGCTGCA CCCGCTAAAA TAATACGCAT CGGATTAAAT CCATCACGTC 3 720 

TAGATCGGCC TATCATTAAT ACAATAGCAC CTCCTAGAAT AGCACCTAAA AATCCAGCAA 3 780 

ACATCAATAT TAAAAATGAA GTGTTTGGTA AAACTGCATA TGTTAATGCT AAAGCAAATG 3 84 0 

AAGCACCTGA ATTTAAACCT ATGAGCGCCG GATCAGCAAG ACCATTACGA GTAACAC CTT 3 90 0 

GTATAATCGC ACCAGAAACT GCAAGCGCCA TACCTACAAT TACTGCTGCT ATATTTCTGG 3 96 0 

GAATC CTAAT CTCATTGATG ATGTTTTGCT GTTGATTGCT AGGATTATAA TTAAAAATAG 4 020 

CCTCTATAAT TGTAGAGGCT TGAATTTTGG CGTCACCTAT TAATGTAGAA ATAAATAGTG 4 0 80 

TGATTAGTAG TATCATACTT AAACCTATAA TATAGGATAA AAACTTCAAT GGCGTTGGGT 414 0 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT TTATCATATT AACTTACTTA ATTAAGAATA 4 2 00 

AGCTCTGCGA CATAAGTCAT AAGTTACCAG TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4 2 60 

TAAAACAACA TCAATATTAA AAACTTTTTC TAATATTTCC TGTGTTAATA CGTCTTCTGT 4 3 20 

TGAACCTGTA GCGATGATAT CCCCTTCTTT CATCG CAATA AGATGATCTG AGAAACGAAT 4 3 80 

CGCTTGGTTG ATATCATGAA GAACCATGAC AATTGTACAA CCTTGTTCCT GATTTAG CTT 4 4 40 

CTGAACTAAT TCTAGTATTT CTAATTGATG ACAGATATCT AAATATGTTG TTGGTTCGTC 4 5 00 

TAAAAAGATA ATATCAGTTC TTTGTGCTAA TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4 5 60 
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TGCCCAATCA ATCTCTTTCT TAtCCTCAGC AgTTAATCTA CCAAATCCTT TTTGATGTGG 46 80 

AAAACGACCA TATGAAACTA ATTCCCCAAC AGTTAAGCCA TCTGCTACTT CaGGTGaTTG 4 74 0 

aGGTAAAaTG GCtATTTTTt TGcAATCyCy TTCGTAGATt GTGtATGAAT ATTTTCACcA 4 8 00 

TCTAAAAATA CTTCGCCTTC TTTAACTGCC AATAAACGTG ACAATGCCTT TAG CAAAGTA 4 8 60 

GATTTCCCGC AGCCGTTAGG ACCAATGATT GACGTCACTT TGCCATCTGG TATTTCAACA 4 920 

TCTAATTTAT TTATAATCGT GTTATCCCCG TAACCAATTT TAACTTGTTG TCCATGCAAA 4 980 

CGATTCATAA TTTC CCTACT TTCAATAAAA TTCTTTCTGT TTATAAAAAA TAATTTCTAT 5 04 0 

TTTTAAATTA TCAATTTTCA AAGACATCCC AATTGATAAT G ATT AT CATG AACATCATTA 5100 

TAACATTTTT CAATCTTATT GACTAACATT ACTTTTTAAA TTGGATAGCT CGATTTGTCA 516 0 

TGTCTTGTAT ATTACTTTTA TAAAATAAAA AACGCCCACA GATAAGTCTT CATAGTTCAA 52 20 

AAACTTGTCC GTGGACTTCT ATTTAAGTAT GTGTGCTCAT ACCATTTATT TATTCATCTG 52 8 0 

CAAGAAAGCC ATTACCATAG ACATCTCTTA CATCATGAAT TACGAGGAAT GCATCTTTAT 5 34 0 

CGATTTGTTT AATTAATCGC TTTGCTTTTG AAACTTGTGT TTTAGAAATA ACAACGTATA 54 0 0 

AGACATCTTT TTCTTCACGC GTATAATAGC CATGTCCGTT TAAAATGGTT AAACCTCTTC 5460 

CAATTTGCTC GTCTATTGCT TTGGCAAGTT TGTCGGGATT AGTTGAAATA ATCGTCATAG 5520 

CTTTTTTAGT GTTTAAACCT TCTATGACAT ATTCCATCAC TTTTGTTCCT ATATAAAGTG 5580 

ATATTACTGT TACTAATACT TTATCAAGTG GAATAACTGT AAGTGAAATT GCAACAACGA 564 0 

TCATATCGAA GAAAAGCAAA GCATATGGCG TGCTTACATC GAGGTATTTC GTTGCAATTC 5700 

TCGC CAAAAT TGTTGTACCT GCTGTTGTAC CGCCTGCAAG GATAATTACT CCGATTCCTA 576 0 

GTCCAACGCT TACACCACCA AAAATGGCAT TCACAATGCT GTTTCCAGTT TCTACTTGCC 582 0 

ATGATTCTGT TAAACTCAAA AATATTGAAA TAAGAATTGT TACAAGAATA GTTAAGTACA 58 80 

TACTTCTCTT ACTCAAAAAt TTATAAc CTA TGGCAATCAA TACTGCGTTG ACCAAGAAGT 594 0 

TAGTGATGGC TGGTGAAATA TGAAACGCAT AATATAAAAT AATTGCTAAA CCTGTAACCC 600 0 

CGCCTTCACC TAAGTTACCA GAAATAATAA ATGCATTTAC ACCTGCAGCA AAGATAAATG 6060 

AACCTAAGAC AACTAGTATT AAATCTTTAA CCGTTTTATT CACGAAACCA TCCCCTTTAT 612 0 

ATATTTATTA GACTAT 6136 
(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



1149 



EP 0 786 519 A2 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

GATATCGTA3 CTACTGAAAA GTCATCACCA CCATGGAATG ATTTCTTTAA ACGTTTTAGC 60 

TTCTATGCAA TTGCAATTCA ATACTTTGTT GTACAATTTA TCATTACATT ATTCTTAATT 120 

TGGTTACCGA CGTATTTAAC AGAAGTATTC CACGTTAACT TTAAAGAAAT GAGCATTAGT 180 

TCATTACCTT GGTTATTAAT GTTCTTCTTA ATCTTATCAG CAGGTGCAAT TTCTGACCGT 240 

GTATTAGGAT TAGGTCGTTC AAAATTCGTA GCTAGAGGTG TAATTGCAAT TGCAGGATTT 3 00 

ATTGTGTTTG CAGTTTCAAT TATCTTTGCT GTACGCACAG GAAATTTATA TGTAAGTATT 3 60 

TTCTGGTTAT CACTAGGTCT TGGTGGTATC GGTATTTCAA TGGGTATGAG TTGGGCTGCA 42 0 

GCAACTGACT TAGGACGTAA CTTCTCTGGT ACAGTATCAG GGTGGATGAA CTTATGGGGT 4 80 

AATATAGGTG CATTAATCAG TCCGCTATTA GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 540 

ACAATGACAT TCCAATTGTT AATCGTT C CA GCAGTAATCG CTGTGATTAT GTGGTTCTAT 6 00 

GTGAAACCAG ATCAACCTTT AATTGTTAGT GATGATAAAG CAATAGAAAA ATAATTTAAA 66 0 

CAAGCAGTAA GCTTTCACAT AGTTGGGGCT TATTGCTTTT TTTGCGTTGA AATTGAAACT 72 0 

TTTTAAAACA GATATGGTTT AAGATGAAAA TGAAGTTATT GAAATGATAT ATGTAAAGAA 780 

ATAAGGTTTT AAAACATTAG TCAGGTAACG CTTGTaAAAG TACATATAAA TTTTAACTAG 84 0 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT GTTAAATAAC aTTAGrAATT AATTTTAATT 900 

GGACTTTAAA AGTTTTTAAA TTTAGATAAT TGAGCATAAG GTGTTATAAT GACATATGTT 96 0 

GCGTAATTAA AATTTATAGC AACAAATTCA TTTTAACTAT GCTAATAAAA AGATTATGGA 102 0 

AATATTTTGA CAAGGAAAGG AGAAGTCGAA ATGACATCTT TTTGACATCA CTCATAAAAA 10 80 

TCAAtCGACT TAACTTAGAC TTTTATAAAG GTGTAAGACA GGGACTGTTA ATGATTATTC 1140 

CTGCAATAAT CGGTTACTTA TGTGGTAATT TCCAATTTGG ATTATTAGTT GCAACCGGAA 120 0 

CACTAGCCCA TATTTATGTT TTTAAAGGTC CGTCGCGATC TAAGCTGCGA ACTGTAATAA 1260 

TTTGTAATTT AGCGTTTGCA ATATGTATGA TGCTTGGTAC GCTAACAGCC AAAACGCCAC 13 20 

TCGTTTTTGG AATGACATTA TTAATTGTTA CGGTTATACC ATTTTATATA TTTACTGCCT 1380 

TAAAAATAGC TGGACCGTCA TCGACATTCT TCATTGTGAC ATTCAGTCTA CCCATTAACT 144 0 

TACCTATAGC TCCCGAAGAA GCATTATATA GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

TTGCCACTAT GATGGTGTTA ATCACGATCG TATTTTCTAA AAACAAAGCT GAAGAACAAG 1560 

CAATTCAAAA TGATTTTAAA CT CAT AT CTA AGTTGTTACA CACTTATAAT GATAAATCTG 162 0 
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TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT CCAAAAATTA TT ATT ATT AC 174 0 

ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT GAACGCTAAA CAAATTCGAC 1800 

CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT TGCACAACTA GATAATAGTG 18 6 0 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC AGAGGAATTT CAAAATTTAT 1920 

TCAACCATAT ATTGAAAATT GATGAAATGG TGCATGCAAA TGAAGCGCGT ATTGCGTATG 1980 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA TCaAAATTTA ACaTTAGACT 204 0 

CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT GATGATAGCG ATATTTATTG 2100 

CGTTAATGTT TGATTTTGAA AAAGCATACT GGATACCGTT ATCTG CACAT ACAATATTAC 2160 

TAGGAACATC AACTATACAT GCAATCGAGA GAGGTATGGC ACGAGGTTTA GGTACTATTT 2220 

TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT ACCAACACCT GTTGCAGTAA 22 80 

TTTTAATGGG CATTGCAGCA TTGTTTACTG AAGCATTGGT GGGAGCAAAT TATGCGATTG 2340 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG ATT AG CAT CA CAGAATTTAA 24 00 

CAATTAACAT TGCGTTTCCA AGAGTTATTG ACGTTGCAAT GGGTATTGTG ATTGCAATCA 2460 

TAGGTTTATT TGTCCTTGGA CAACGTACCG CATCCGCATT GCTTCCTAAT GTAATGGCTG 2520 

AAGTTGTTCG TAAAGAAGCA ACGCTCTTTC ATTATTTATT TTCTGAAAAT CAATAT 2576 
(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

CAAGCACATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 

TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 

TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAATCATATT GAATTTCAAG 180 

TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 24 0 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA T CAATTGTTC 3 00 

ATATTC TAAT TTTTCTTTAT ATG ATAAAC C ATTTTTCTTA CGTACAGTTG TAGAAGATTT 3 60 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG AC CATCATGA ATAAACCAAT ATGACTGTGC 4 80 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 600 

CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 66 0 

ACCAGATA 66 8 
(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

CATCAGTTTA TTTTGAAAGG CAATGCGATC ATTT CATGTA TTTATGTTGT TTG AAA CATC 60 

GTTAGATAAC AATAGTGATA TTGCACATTT TAAGCTGAAG ATGGAAAAAT ATTCGATAAT 120 

CAAATAAACA ATGAATTTTA GAAGGT A CAA TGACGTTCAC TAATTTAAAT ATAGCTGAAT 180 

GTGTTGGTGA GTGATGTTCA CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 240 

TTTTTATTAA GCGTTAGGTT GAATGTATGA GAATTTTAGA TTTATAATAG AAGATAGAAA 300 

CGAAAATTTT TCTTAAAAGC AGTAATGTTG ACTCAAAATA AGCTATAATA ATGACACTTA 360 

TTTAATTGAT TAACATTTGC TAATAAATAT CAATATAGAA TATAACTTTC CAATAATGAC 420 

TGAGAAAATC GAAATGTCAG TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 4 80 

ATTTAGGGTT GAACTATACA TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 54 0 

GCTTTAAAGT GGAATATACT TGAAATAATT AGTAGAGGTG AGTAAGGATG AG T AATAAAT 600 

TAGAATCATA CAGAAGTGAG ATTGTATCAC TGAATCATCA AATTTTAGAC TTATTATCTA 660 

AACQTGGTGA ACTAGCACAA AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 720 

ATGATCCACA ACGTGAAAAA GAAATGCTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 7 80 

TCAACGATAA TACTATTAAG CAATTATTTA AAGAAATTTT CAAAGCCTCT ACAGATTTAC 84 0 

AAAAATCTGA AAATGAAAAA CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 900 

TTGTAACATT TGATAATGGG GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 960 

CATGTTCAGT TGAATCATTT GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 1020 

GTGAAAAATT TATTCGTGGC GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAG 1080 

GCCTAGGTGT TGAAGGACTT AAAATACTTA AACAGATTAA AGATAAATAT GATTTAAATG 1140 

TTGTCAGCGA AATCGTAAAT CCAAATGATT TTGAAGTGGC TGATGAGTAT TTAGACGTAT 1200 
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AAAAGCCTAT TCTATTAAAA CGTGGTTTAT CTGCTACAAT CGAAGAGTTT GTTTATGCAG 1320 

CTGAATACAT TGCTTCACAA GGTAATCAAA ACATTATTTT ATGTGAACGT GGAATCCGAA 1380 

5 CTTATGAAAA GGCGACACGT AACACTTTAG ATATTTCAGC AGTACCAATT TTAAAACAAG 14 4 0 

GTACACACTT ACCAGTCATG GTAGATGTTA CGCATAGTAC AGGTCGTAAA GATATCATGT 15 00 

TACCAACTGC GAArsAgCAT TAGCAGTTGG TGCTGATGGA GTTATGGCTG AGGTGCATCC 1560 

AGATCCATCT GTTGCACTTA GTGATGCGGG TCAACAAATG GATTTAGATG AATTCCAAGC 1620 

ATTTTATGAT GAATTAAAGC CTTTAGCTGA TTTATATAAC GCTAAAAAGT TAAAATAATA 16 80 

j5 TT C CAAGGAA ACTATAGACT ACTTAACTAA TATGTCATGT TGAAGTAGAA TATTATCTTT 174 0 

GAATCGACAA TTTTAAACTT ACAGCCATTC TAAGAGTATA TTACTTTTAG AGTGGCTATT 18 00 

ATTTTTTGTA TAGAAATAAA GGTATACTGC ACTTAACGAT TGTTATAATA CTTCGACACT 1860 

20 TGTTCAATTT CACAATTATT AAAGATTATG ACTGATAGCA GTAATTAAAA TTATAACTAT 1920 

GAATTATCTG TAAAATATAA TAGATTCACA CATTTGTTGC TGAAATGTGA ACATTTTTCA 1980 

ACAAATGCAA TTGATATTTG AAAAGGCTTT CTCAAAACAT TACAATTAAA AATGAAAAAA 2040 

25 GTTTATATAA AATTAAAATA TATCGTTCGT TATCATTTAG CGTTTGTTTT TATTTCAAGC 2100 

TTTTCGCTAA ATTTTTCCAA ACAAAAATAT GTTACTGTAA ATTAAAATAT GGTAAACTAT 2160 

GAAAATGAAA TGAAAACATG TTATTATAAT GAATAAAACG TTTACAAGGA GGAAATTATG 2220 

30 

ACAGTTACTA TATATGATGT AGCAAGAGAA GCGCGTGTCT CTATGGC CAC AGTGTCGCGT 2280 

GTTGTTAATG GGAACCAAAA TGTTAAAGCA GAAACTAAAA ATAAAGTTAA CGAAGTCATT 234 0 

AAGCGTTTGA ATTATCGTCC AAATGCTGTT GCTAGAGGTT TAGCTAGTAA AAAGACAACA 2400 

35 

ACAGTAGGTG TGATCATTCC AGATATATCT AATATCTATT ATTCACAACT TGCTCGTGGA 24 60 

CTTGAAGATA TTGCAACAAT G T AT AAAT AT CACTCAATTA TTTCAAATTC AGATAACGAT 2520 

40 CCTGAAAAGG AAAAAGAAAT TTTTAATAAC TTATTAAGTA AA CAGGTTG A TGGTATTATT 258 0 

TTCCTTGGTG GTACAATTAC TGAAGAAATG AAAGAATTGA TAAATCAATC ATCTGTACCT 264 0 

GTAGTAGTAT CAGGAACAAA TGGTAAGGAT GCACATATAG CATCAGTTAA TATTGATTTT 2700 

45 ACTGAAGCTG CGAAAGAAAT TACGGGAGAA TTAATTGAAA AAGGCGCTAA ATCATTTGCT 2760 

TTAGTAGGTG GAGAACATTC TAAAAAAGCT CAAGAAGATG TTTTAGAAGG TTTAACTGAA 2 8 20 

GTGTTAAATA AAAATGGCCT TCAATTAGGT GATACATTGA ATTGTTCTGG TGCTGAAAGT 28 80 

SO 

TATAAAGAAG GCGTAAAAGC TTTTGCCAAA ATGAAAGGCA ATTTGCCAGA TGCCATTTTA 294 0 

TGTATCAGCG ACGAAGAAGC AATTGGTATT ATGCATAGTG CAATGGATGC TGGTATTAAA 3 000 

55 
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CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATG CGCTTA 3120 

TTAACAAAAT ATATGAACGA TGAAAAGATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3180 

5 ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 324 0 

AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG 3 300 

TTACTTATTG ACGAGTAGAT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 3360 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 34 2 0 

TTTTAGGTAA TAATTCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGTAAAT 34 80 

1S AACCACTTTG AATTGGTTGA TCAATTAAAC TTAAGAATAC ATGACTC CAT GCACGTGGTA 3 54 0 

CGACTCTCCA AATATTGTAG CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3 600 

AAT CAGCTAA ATATTTTACA AAATATGGAA TTTCATATAA TGAATGTAAC GTACAATTTA 3660 

20 GATGAGTTAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 3 720 

GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA ACACTCCAAA AATGATGCAT 3780 

CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 3 840 

25 GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTCC GGTTTCATGG ATAGAATAAG 3900 

TAGTAACATG GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCCATGA TGTGCATCGG 3 960 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4 020 

30 

TATCATTGTA TATACAAAAA CCACTTGCTC GACCAGGTTG AGCGTGATGC AAACCACCAC 4 0 80 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 414 0 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 4 200 

35 

TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTGAAGCA TGCTTAATAG 4260 

CTTCGACGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4 320 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4 380 

AACGCATTTG ATTGAAGGGA TGTTGGT CAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTCCGATAT TCCAAAAATT 4 500 

45 AAAAGAAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4 560 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAG CTGGATG ACTTGTTATT TCTGGATCAT 4620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TTTTTATAGT 4 680 

SO 

CAAATACATC TAACTTTGAA TTTTT TAAAT CCCAATGCCA GTAAT ATT CT GTAGTTATAA 4 740 

CGATATAATT CTCGAATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4 8 00 

55 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG GATAGTGGAA AGTGACATAA CCCACAATGA 4 920 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4 980 

CTTTATACTG ATCTTCAGCA TCTCTAAATG CGGTTAAATG CGCATCGAAA GTAAGCGCTT 504 0 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA TAACAAATTG CTTGTCATGA ATGTAATAAT 5100 

CTTCGGATTG ATACGTCTTT AAATGATTCA TATTTTCAAC TCCTCAATCG ACGT CGTGAT 5160 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT ACTAGAAAAA GGATAAAAGT 5220 

AAAAATTTTG AATAATTAGA AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTT CACAT 52 BO 

AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAA CATAA 534 0 

CCTTAAAGAT TATGAAGAAA CATATAATAC TTTTGATTGG AAAGACGTAG AACAAGCATT 54 00 

TTCTTGGAGT GAAACTGGAA AAATGAACAT GGCATATGAA TGCATAGATC G C CATGT AG A 546 0 

TCAAGGATTA GGGGATAAAA TAGCGTTAAA TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TACTTATAAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 55 8 0 

AGAAGTTGAC AAAGG TG ACA GAGTATTTAT ATTTATGTCG CGTACACCTG AACTATATTT 564 0 

TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATTTG AAGCATTTAT 5700 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 5760 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 58 20 

AGATGAGGAT GTAGAAGACA ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 58 8 0 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 594 0 

TTCTACTGGG cAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TG CACTATAT 6000 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGATGATGTT TATTGGTGTA CAGCAGATCC 6060 

AGGTTGGGTT aCAGGAACAT CTTATGGTAT TTTTGCACCA TGGTTAAATG GCGCTACAAA 6120 

TTGTATAGCT GGTGGTCGCT TTTCGCCAGA ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG CACCAACAGC TTTAAGAATG TTAATGAGTG CTGGTGACGA 624 0 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT CTATCAGTAG GTGAGCCTTT 6300 

AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAACGGTGT TAGATACTTG 6360 

GTGGATGACA GAAACAGGTG GACATATGAT TGTTAACTAT CCAACGATGG ACGTcAAGCT 64 20 

TGGCTCAATG GGCAAACCAT TACCTGGTAT TCAAGCTGCA ATTATCGATG ATGCAGGGAA 64 80 

TGAATTACCA CCAAATCGAA TGGGCAACCT TGCTATAAAA AAAGGCTGGC CATCAATGAT 654 0 

GTATCGTATC TGGAAGAATC CAGAAAAATA TAAATCATAT TTTATTGGAG ACTGGTATGT 6 600 
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TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGAGT CTAAATTGGT 672 0 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 67 8 0 

AATAATTAAG GCGTTTGTTG CACTGAGAAA AGGATATGAA CCAACAGACG AATTAAAAGA 6 34 0 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6 900 

ATTTAAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TATTAAAAGC 6 96 0 

TTGGGAATTA AATTTAGATG CTGGGGATTT AAGTACAATG GAATAATGAC ATGAATGTTA 7 020 

TTGAAGATTT TTTTCGAAGA ATAAAGGGTG ACAACATATT TCATGTCAAT GTTTAAATAA 70 80 

TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 714 0 

AAAGCATTTG AAAATGATGA CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 720 0 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 7260 

TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAAATGAA 732 0 

AGCGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 7380 

AAAATTTCTT AGCTACAACT TACATATTAT AAATG CAT AA ATTAAACAAA AAGGGGCGAA 744 0 

AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA ATCAACACTA CAACCAATTA 7500 

AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT GGTCATTACA 7S60 

AAGCTAAAAT CGACATTAAT AAAATTACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7620 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 768 0 

CTGATGCATT CC AT GAGTTA AATAAAAACG TTATGGTTGC ATTAAGAGAG CCTGCTTTAG 774 0 

GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCGCAA GTCTTACCTA 7800 

TGGAAGATAT CAACTTACAT TTCAACGGAG ATTTCCATGC GATTACAACT GCAAATAATG 786 0 

CATTGTCTGC GTTTATCGAT AATCATATTC ACCAAGGTAA CGAATTAGGA ATCGATCAAA 7920 

GACGTATTGA GTGGAAACGT GTATTAGATA TGAATGATCG TGCACTTAGA CATGTAAACG 7980 

TTGGGTTAGG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8 04 0 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

TTAGTCGTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT GCAGATTTAA 8160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC TTAGTACAAT 8220 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 8280 

GTAACTCAAT TTTAGCAACT GAAACAGCAC GTGATTTAGC TGATATCGTT GTAACGGAAG 8340 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATT CATGGA CATTAAAGCG CGTGAAGCAG 8400 
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GTGTAGCGAA AGATAATTTA 
ATTTAGAGCG TCATGTTAAT 
ATGCATTTAT ACATGATACC 
ATAACGTACG AATTGC CTTA 
TAGCAAATGA AGTATTAGAA 
T AG AATT AC C ATTAGAGCAA 
AAGTAACGTT TAGCAGTAAA 
ATAATTACCC AGTATGTATG 
TAGGTGCACC ATCAGGATTT 
GATTTATCGT AGCGTTGACA 
CAG CAT T AAA CATGGATGTT 
TGTCAATTGT TTAATAAAGA 
TATTTCAGCA GTTCGGTTTT 
ACAATGTTTG AAGGTGGTTA 
GAAAGCGATG CAATGAATAT 
AATTGGGATG CGCTACAATT 
ATAATCTTAT AGGAGTTGTT 
ATTCTATTAG AAAATCAACT 
TTACTTCTCA ACATCAAGCA 
AAAATCAAAA TAATAATGCA 
CTGctACGCA ACCAGCAAAC 
ATGCAATTAA AGATCCTGCA 
TCAATTTCCA GTTATTAGAT 
AAGATCCAGC AGATGTGTAT 
CTGCTTCAAC ATGGAAGAAG 
TTGTATCATA TAGTCCTGTA 
GCACACAAGA ATTGAAAATT 
ATGATTATAC TAAATTAGTA 
CAGATACAAA TGATGCAGTA 



AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 8520 

AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 858 0 

GATGCAGAAG TAGAATATGT AAAATCTTGG G CTAAAGAAA 864 0 

ACTGAAGTTT GGGAAAAAGG TGGTAAAGGT GGCGTTGACT 87 0 0 

GTCATTGATC AACCTAATTC ATTTAAACCT TTATATGAAT 8760 

AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8820 

GCGCAAAAAC AATTAAAACA ATTTAAAGAA AATGGTTGGG 88 80 

GCGAAAACAC AATATTCATT CTCAGATGAT CAAACGTTGT 894 0 

GAAATTACAA TTCGTGAATT AGAAGCGAAA ACAGGTGCAG 900 0 

GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAACCAG 9060 

ACTGATGATG GTCATG CAAT TGGGTTATTC TAATAAATCA 912 0 

TAAGTAAATA GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 924 0 

ATTAATGTAT ATGAAGATAA AAAGTGGGCT TGAAGAATAG 93 00 

T C CAT ATT AA AAAAAATTAA TAAAATAGGT TGCAATATTT 93 60 

AACACTAATA ATTGATATTG ATAATTATTA TCAATTAAAT 94 20 

AACAACATGA ACAAACATCA CCCAAAATTA AGGTCTTTCT 94 8 0 

CTAGGCGTTG CATCGGT CAT TGTCAGTACA CTATTTTTAA 9540 

CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 96 00 

ACTACAACTC AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

TTAGAAAATA AAGAACATGA TATAGGTCCA AGAGAACAAG 97 80 

AAAAACAATG AAACG CAGTA CTATCACTTT TTCAGCATCA 9840 

TACACTAAAA AGAAAGCAGA AGTTGAATTA GA CAT CAAT A 9900 

TTTGAAGTCT ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

CCAGAAGACC ATGC CTATAT TCGATTCCCA GTTTCAGATG 10020 

GTTTCTTCGA CTCAAATTGA TGATGGAGAA GAAACAAATT 10080 

TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 1014 0 

GTAACGAATG ATCAATCAAG TTCAGTCGCA AGTAATCAAA 10200 
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AGGCAACGAC CAATATGAGT CAACCTGCAC AACCAAAATC GTCAAC GAAT GCAGATCAAG 10320 

CGTCAAGCCA ACCAGCTCAT GAAACAAATT CTAATGGTAA TACTAACGAT AAAACGAATG 103 8 0 

AGTCAAGTAA TCAGTCGGAT GTTAATCAAC AGTATCCACC AGCAGATGAA TCACTACAAG 10440 

ATGCAATTAA AAACC CGGCT ATCATCGATA A a GAACATAC AGCTGATAAT TGGCGACCAA 1050 0 

TTGATTTTCA AATGAAAAAT GATAAAGGTG AAAGACAGTT CTATCATTAT GCTAGTACTG 10 5S0 

TTGAACCAGC AACTGTCATT TTTACAAAAA CAGGACCAAT AATTGAATTA GGTTTAAAGA 10620 

CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 10680 

TAGTATCATA TGATTCTGAT AAAGATTATG CCTATATTCG TTTCCCAGTA TCTAATGGTA 10740 

CGAGAGAAGT TAAAATTGTG TCATCTATTG AATATGGTGA GAACATCCAT GAAGACTATG 10800 

ATTATACGCT AATGGTCTTT GCACAGCCTA TTACTAATAA CCCAGACGAC TATGTGGATG 10 860 

AAGAAACATA CAATTTACAA AAATTATTAG CTCCGTATCA CAAAGCTAAA ACGTTAGAAA 10920 

GACAAGTTTA TGAATTAGAA AAATTACAAG AG AAATTG C C AGAAAAATAT AAGGCGGAAT 10980 

ATAAAAAGAA ATTAGATCAA ACTAGAGTAG AGTTAGCTGA TCAAGTTAAA TCAG CAGTGA 11040 

CGGAATTTGA AAATGTtACA CCTACAAATG ATCAATTAAC AGATTT A CAA GAAGCGCATT 1110 0 

TTGTTGTTTT TGAAAGTGAA GAAAATAGTG AGTCAGTTAT GGACGGCTTT GTTGAACATC 11160 

CATTCTATAC AGCAACTTTA AATGGTCAAA AATATGTAGT GATGAAAACA AAGGATGACA 11220 

GTTACTGGAA AGATTTAATT GTAGAAGGTA AACGTGTCAC TACTGTTTCT AAAGATCCTA 112 80 

AAAATAATTC TAGAACGCTG ATTTTCCCAT ATATACCTGA CAAAGCAGTT TACAATGCGA 113 4 0 

TTGTTAAAGT CGTTGTGGCA AACATTGGTT ATGAAGGTCA ATATCATGTC AGAATTATAA 114 00 

ATCAGGATAT CAATACAAAA GATGATGATA CATCACAAAA TAACACGAGT GAACCGCTAA 114 60 

ATGTACAAAC AGGACAAGAA GGTAAGGTTG CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

CTGCAACAAA TCCTAAAGAT GCGTCTGATA AAGCAGATGT GATAGAACCA GAGTCTGACG 115 8 0 

TGGTTAAAGA TGCTGATAAT AATATTGATA AAGATGTGCA ACATGATGTT GATCATTTAT 116 4 0 

CCGATATGTC GGATAATAAT CACTTCGATA AATATGATTT AAAAGAAATG GATACTCAAA 117 00 

TTGCCAAAGA TACTGATAGA AATGTGGATA AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 

CTAATGTCGA TACTGATAAA GACTCTAATA AAAATAAAGA CAAAGTCATA CAGCTGAATC 118 20 

ATATTGCCGA TAAAAATAAT CATACTGGAA AAGCAGCAAA GCTTGACGTA GTGAAACAAA 118 8 0 

ATTATAATAA TACAGACAAA GTTACTGACA AAAAAACAAC TGAACATCTG CCGAGTGATA 1194 0 

TTCATAAAAC TGTAGATAAA ACAGTGAAAA CAAAAGAAAA AGCCGGCACA CCATCGAAAG 12000 



1158 



EP 0 786 519 A2 



CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 1212 0 
AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC 12173 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 311: 



CAACATTAAT 


ATTGATATTA 


AATCTTCCTG 


GATAACGTGC 


TTGTTCGAGT 


GATAAGTATG 




CACG CACTTG 


ACTTAAC TCT 


TTATCTAAAG 


TAATCGTATG 


TTGCTTAGAG 


C CTTGTAAAT 




TCGCTCTGAA 


AAAATAACTC 


AATTCTAATA 


GTAACTCTCG 


TGCCTTTT CG 


CTATTTATTC 




TAACTAAAGC 


TGAGATCGTG 


TTAATTGAAT 


TGAAGAAAAA 


ATGTGGACTC 


ACTTGTGCCT 




GTAATGACTT 


AATCTCAGCA 


TCTTTCAATA 


ACTTACTTTG 


CGTTTCGG CT 


TCACCAAGTT 




CAATTTGGCT 


ACTAAAAATA 


TTTGCCAATC 


CTTCTGCAAG 


TTGACGTTCC 






AAT CATT AGG 




TACATCTTCA 










GTGGTATCAC 


GATAGCTGCT 


CTAAGCGGG C 










TAGTATGAAC 


TTCTTTCAAC 


TTTCCTGATT 


TCAATACATC 


TTTAGACAGA 


CTTGTTAATA 


540 


TTTCATTTGT 


TGGTATGTGA 


TGATCACTAC 


CTGCACCTAC 


ATGCGATAAG 


ATTTCATTTT 


600 


TGCTTGTAAT 


TGCTACGGCA 


GATACTTTCA 


TTAAATTTTT 


AAT AAT CATC 


GCAATTTGCT 


660 


GTGCCGATTC 


TCTATTCAAT 


CCTTCTTTAA 


AATACGGCAA 


TGTCTGGTTC 


ATCAATTGCA 


720 


GTACATCATG 


TGTTTGAACA 


GCCTTCATTT 


GCTCCTCTTG 


CTTTAATGTT 


GAAATGATAA 


780 


TAGACATAAA 


AATCGCCGTA 


CCAACGCTAT 


TAACAATAAT 


CATTGGTAGT 


GCAATTAATG 


840 


ATATGAGGTC 


AACCGCATAT 


GCTTTGTCGT 


GGGAAAATGT 


TAAAATGCTC 


AACATTTGAA 


900 


TCATTTCCAT 


AACAATTCCA 


ATCATGGCAC 


TTTTCGCAAT 


ACTCGGGTAA 


CGCTTGCGTC 


960 


TTTGAGCTTG 


TAAGCCAAAA 


TAACCAGCAA 


TTATACCAAT 


AAATATAGAT 


GAGATAAGAT 


1020 


AAACTTGTGC 


ATCCGCCCCA 


CCCATATACA 


CTCTGAAAAT 


ACCTGAAATA 


ACGCCAACAA 


1080 


ATAGACCTAC 


AAAAGGGCCA 


CCAACTAATC 


CTGCGACACC 


TATCGTTAAT 


ACACGTGTGT 


1140 


TAGCTAAAGA 


TACATCATCA 


TCTAAACGGA 


AGTACACACT 


TCCTGACAAA 


CTATGTTGAT 


1200 


GATCGATGAC 


GATACCAGTT 


AAATTAGACA 


TTAAGGCAAA 


CAAACTGAAA 


ATAATACATA 


1260 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 
s (A) LENGTH: 7972 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 60 

15 TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 12 0 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGT CATTA AGAGATGAAG 180 

CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 24 0 

20 CTAATAAAGA GGAATTAAGT TTAGCATACT CACCTGGCGT TGCTGAACCG TGTAAAGATA 300 

TTTATGAAGA TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTGCAGTTA 360 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG 4 20 

25 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TAT CAATGGG GTGCCTATTG 4 80 

CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT 54 0 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTtG TTTTGAAATT GAAGAACGAT 600 

30 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA G CAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 72 0 

3S TTGTACTAAA TGGTGCTGGT G CAG CAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCG CAAT TTTTGAAGGA CGTT CATATG 84 0 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT 90 0 

40 CTTTAGAAGA AGTCGTAAAA GACG CAGATG TATTTATCGG GGTTTCTGTA GCTAATGCGC 960 

TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAAT AC CT GATGATGCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG 1080 

45 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTC CCTGGT ATTTTTAGAG 114 0 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 1200 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

50 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAG CGGCA ATGGAATCTG 1320 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 1380 

55 
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ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA CAGTCAAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAGTTCAGTT TAAGGCTCTA AATGGTTAGA 1560 

5 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 1620 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 16 80 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATG TGTG 174 0 

10 

CTTTAATTGT GATCATCATA TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 18 00 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 18 60 

,5 aAGTTATTTA GAAAAAATTG AAAAGGACCA ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATGGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 19 80 

TAGAATGGGA AGTATGGGAT CGGTTATCGG TGAAAAGATA TGTCGCATCA TTGATTACTG 2 04 0 

20 CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA TTATATATAT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

25 TTTTGCATCA GTTGGTGATA TAAATTTAAG TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 22 80 

TCGTCGAGTT ATTGAACAGA CAATAAACGA AAAATTGCCA GATGATTTCC AAACTGCAGA 234 0 

ATTTTTATTA GAGCATGGAC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 24 00 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 24 6 0 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2 52 0 

3S GTGGATTTAC AAGAAGAAAT TGACATGCTT GAAGCGTcAT TGGAACGAGA AACTAAAAAA 2 58 0 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTGCA AGAAAGACCT 2 64 0 

ACGACCCTAG ATTaTATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 270 0 

40 AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TCGTGCTGTT 2760 

ACAGTTaTTG GACAACAACG TGGAAAAGAT ACAAAAGATA AT ATTT AT CG AAATTTTGGT 2 82 0 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACGTTTAA TGAAACAAGC TGAAAAATTC 2 8 80 

45 

AATCGTCCTA TCTTTACATT TATAGATACA AAAGGTG CAT ATCCTGGTAA AGCTGCTGAA 2 940 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3 0 00 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3 0 60 

so 

GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTATATCTCC TGAAGGT GCA 3120 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 
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GGTGCACATA AAGATATTGA ACAGCAAGCT TTAGCTATTA AATCAGCGTT TGTTGCACAG 3 3 00 

TTAGATTCAC TTGAGTCATT ATCACGTGAT GAAATTGCTA ATGATCGCTT TGAAAAATTC 3360 

5 AGAAATATCG GTTCTTATAT AGAATAATCA ACTTGAG CAT TTTTATGTTA AATCGATACT 342 0 

GGGTTTTACC ATAAATTGAA GTACATTAAA ACAATAATTT AATATTTAGA TACTGAATTT 348 0 

TAACTAAGAT TAGTAGTCAA AATTGTGGCT ACTAATCTTT TTTTAATTAA GTTAAAATAA 354 0 

10 

AATTCAATAT TTAAAACGTT TACATCAATT CAATACATTA GTTTTGATGG AATGACATAT 3600 

CAATTTGTGG TAATTTAGAG TTAAAGATAA ATCAGTTATA GAAAGGTATG TCGT CATGAA 3660 

is GAAAATTGCA GTTTTAACTA GTGGTGGAGA TTCACCTGGA ATGAATGCTG CCGTAAGAGC 3 720 

AGTTGTTCGT ACAGCAATTT ACAATGAAAT TGAAGTTTAT GGTGTGTATC ATGGTT AC CA 378 0 

AGGATTGTTA AATGATGATA TTCATAAACT TGAATTAGGA TCAGTTGGGG ATACGATT CA 384 0 

20 GCGTGGAGGT ACATTCTTGT ATTCAGCAAG ATGTCCAGAG TTTAAGGAGC AAGAAGTAC G 3900 

TAAAGTTGCA ATCGAAAACT TACGTAAAAG AGGGATTGAG GGCCTTGTAG TTATTGGTGG 3 96 0 

TGACGGTAGT TATCGCGGTG CACAACGCAT CAGTGAGGAA TGTAAAGAAA TT CAAACT AT 4020 

25 CGGTATTCCT GGTACGATTG ACAATGATAT CAATGGTACT GATTTTACAA TTGGATTTGA 4 080 

CACAGCATTA AATACGATTA TTGGCTTAGT CGACAAAATT AGAGATACTG CGTCAAGTCA 414 0 

CGCACGAACA TTTATCATTG AAGCAATGGG CCGTGATTGT GGAGATCTAG CATTATGGGC 4200 

30 

TGGATTATCA GTTGGTGCTG AGACAATTGT AGTTCCAGAA GTGAAAA CAG ATATTAAAGA 4 2 60 

AATAG CTG AT AAAATTGAAC AAGGTATTAA ACGTGGTAAG AAACACTCAA TCGTTCTTGT 4 3 20 

AGCAGAAGGT TGTATGACTG CGCAAGATTG TCAAAAAGAA TTATCACAAT ACATCAATGT 4 380 

35 

TGATAATAGA GTGTCTGTGT TAGGTCACGT TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 4440 

TAGASTTTTA GCATCACGTT TAGGTGGATA TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4500 

40 AGCTAAGGGT GTTGGAATTA AGAACAATAA AATTGTAGCA ACATCTTTTG ATGAAATTTT 4 560 

TGATGGTAAA GATCATAAAT TTGATTATAG TCTATATGAA CTTGCTAACA AGTTATCTAT 4 620 

ATAAGATTTC AGGAGGAATT ATAAAATGAG AAAAACTAAA ATTGTATGTA CAATTGGACC 4 680 

45 AGCTTCAGAA TCAGAAGAAA TGATTGAGAA ATTAATCAAT GCTGGTATGA ACGTTGCACG 4 74 0 

ATTAAACTTT TCACATGGTA GTCATGAAGA GCATAAAGGT AGAATTGATA CAATTCGTAA 4 800 

AGTAGCTAAA AGATTAGACA AAATTGtAGC AATTTTATTA GATACAAAAG GTCCAGAAAT 4 860 

50 

TCGTACGCAT AATATGAAAG ACGGTATCAT TGAACTTGAA CGTGGCAACG AAGTTATTGT 4920 

TAGCATGAAT GAAGTTGAAG GAACACCTGA AAAGTTCT CA GTAACATATG aAAACTTAAT 4980 

55 
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TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

5 

AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 52 20 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5 2 80 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 5 340 

10 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 54 00 

TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATCAGACAA TGTAACAAAT TAGGTAAACC 54 60 

(S AGTTATTACA GCTACACAAA TGTT AG ATT C TATGCAACGT AACCCACGTG CTACACGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATG CAGTAA TGTTATCTGG 55B0 

TGAAACTGCT GCTGGTTTAT ATCCTGAAGA AG CTGTT AAA ACAATGAGAA ATATTGCTGT 5 640 

20 ATCAGCTGAA GCAGCCCAAG ATTACAAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 5700 

AACTTCATTA GTGAATGCTA TCGGTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 57 60 

TAAAG CAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5 820 

25 TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTTCAAT 58 80 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6000 

30 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

3S AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 624 0 

AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 6360 

TGCTCAAGGT AAAATCTTTG AAGGATATGC AAA CGTACTA TAATTTATAA AAAAACGTCT 6420 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA TGACGTAATT 64 8 0 

45 AATAGGTTAT TTGATAATGA TAGTGTATGA ATGGCAATCT ATATAAATGT TTATATCTTT 654 0 

TATACATGTA CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 6600 

AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

50 

TTTTG AG C CA AGTACGCAAT TTAATATTAT AAGTTG CATA TAAAACAGGA TGGGACATAA 6720 

ATCCCTAAAA AAACAG CAGT AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 67 80 

55 
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tCTTCGACTG GCACTGCTCC CTCAGGAGTC TCGCCATTAA TACTACGTAT TAACATGTAA 6900 

TTTTACTTTT ACATACTTTA AAAAAATAAA ACACTTTGCC CAACTTGCAC ATAAATGTAA 6 960 

5 

AATTCAATAA AATGAATTTT CTGTGTTGGG TCCCTTCGTA TAATTTAATA AATACCACTA 7 020 

AACTAAATTA ACGAGGTGCC TTATGTATAA AATTTATAAC ATGACCCAAC TTACACTACC 7080 

AATAGAAACC TCTGTTAGAA TTCCTCAAAA TGATATTTCG CGATATGTTA ATGAAATTGT 714 0 

TGAAACGATA CCTGATAGCG AATTCGATGA ATTCAGA CAT CATCGTGGCG CAACATCCTA 7200 

TCATCCAAAA ATGATGTTAA AAATCATCTT ATATGCATAT ACTCAATCTG TATTTTCTGG 7260 

,5 TCGTAGAATA GArAAATTAC TTCATGACAG TATTCGAATG ATGTGGTTAG CTCAAGATCA 7320 

AACACCTTCT TATAAAACTA TTAATCGTTT TAGAGTGAAT CCTAATACTG ATGCGTTAAT 7380 

TGAATCTTTA TTTATTCAGT TCCATAGTCA ATGTTTAAAG CAAAATCTTA TTGATAATAA 7440 

20 TTCAATTTTT ATTGATGGTA CAAAAGTAGA AGCTAATGCC AATAGATATA CATTTGTGTG 7500 

GAAGAAAAGT ATTCAAAATC ACGAATCGAA ATTGAACGAA AATTCAAAAA CATTATATCG 7560 

TGACTTAGTT GAAGAAAAAA TAATACCAGA GATAAAAGAA GATGGAGATA GCGATTTAAC 7620 

25 

AATAGAAGAA ATAGATTTAA TTGGTAGTCA TTTAGATAAA GAAATCGAAG ATTTAAATCA 76 8 0 

TTCTATTGAG AACGAAGATT GTGCTCAAAT TAGAAAACAG ACCCGTAAAA AAATAACTGA 774 0 

GATTAAGAAG TTCAAAAAGA AATTTGATGA TTATTCCGAA AGAAAAAATA AATATGAAGA 7800 

30 

ACAAAAATCG ATTCTTAAAG ATAGAAATAG TTTTTCTAAA ACTGATCTGA TCATGATGCA 7860 

ACTTTTATGA GAATGAAGGA AGACCATATG AAAAATGGCC AACTTAAGCC AGGATACAAT 7 920 

35 TTACAAATAG CGACAAATTC TCAAAAATGT TTTATCCTAT G AC CTATTTC AA 7972 

(2) INFORMATION FOR SEQ ID NO : 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3175 base pairs 
4 o (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60 

ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 120 

50 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GT CAAGTTG A TTTATGCTCA 180 

CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 24 0 

55 
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AATTAAATTA TTGTTGTTTT AACCATTGTA CTATCGTCGG AATCATAAAT CCTGTTGGCC 3 60 

CTTTTGGACC ATTATATGAA GCTTTATTAG TCGTTGCTGG ACCTGCAATA TCGAAATGAA 4 20 

5 TGTGAGGTGT TTGACCACTA AAATGTGTTA CAAAACTTGC CGCAAATAGC GCTTTACCTT 4 80 

GTCCATTCGT ATGGTTAACT AAATCAGCGA TATCACTGTG TTTAATACTT GCACGTTCGG 54 0 

TTGCAGTAAT CGGTAATTCA AATACCATTT CATCGACTTC AGAACTTATT TGTAATATAT 600 

w 

CGTTTAATAT CACTTTACTA TTCGATTCAA ATGCAGCAGC TTTATCATCG CCTAGTGCAa 660 

CAATTGCTGC ACCCGTTAAT GTAGCAAAGT CCATAATCAC ACTAGGCTGA TATTGATTTG 720 

CATAAAACAC AGCATCTGCA AGGACTAATC TACCTTCAGC GTCTGTATTC ATTACTTCTA 78 0 

15 

CAGTTTCACC ACTTAATGCT GTAAATACAT CATCTGGcTT CATTGATGCT TCATTTATCA 84 0 

TATTTTCAGC ACACGCAAGC ACTCCGACAA TATTTACAGG CAGTTGTAAA CGAcTAGCCG 900 

20 CTTCAATGAT ACCAACGACA TTCGCAGCGC CACACATGTC AAACTTCATT GTAGCCATGC 96 0 

CATTCTTCGT TTTAATACTA TAACCACCAG AATCATACGT TATACCTTTA CCAACTAAGG 102 0 

CAATTGGTGC TTCATCTTTG TCTTTGCCAT TATATGTGAT GGTTACTAAt CTCGGTTTAT 10 8 0 

25 GCTTACTACC TTTACCTACT GCTTGTAAAA GTCCGAATCC TTCAGAAACT AAAGTGTCAT 114 0 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT TTTTAAAATG ATTAACAATA TCTTCTGcAA 1200 

ATGTTTGTGG TGtTAATACA TTCGGTGGcA TATTACTAAA GTCTCyTGcC AAATTAATGG 1260 

30 aTTGGcCAAT ACTGATACCC TCATGAATAA AATCTAATTC AATTAATGAT TCACTAATAA 1320 

GGkTTAAATT CGTCTTAAAC GGtGCCTTCT kACTTgrTTT ATAATGATCG aATyCATATG 1380 

kTGcACGCTC ACTTTGGAAT ACCGCATGCC ATTAATACAT CTGgATAACT GgATCcATAT 144 0 

35 

TTTGgAnATA AATGaATCCa TAAGTAAATA CGTATCTTCa ATGTGTTCTG ACTTTATGTA 150 0 

TTGGAAAAGA TGTCCCCaTA TTTTCAACAT ATCTTGATAA tACGTGTCTT TAAGTTTCCT 1560 

40 AAACCAACTG TAATTAAACG ATATGTTTGA TCTTGTACAT CAAATG CAGT TGTATAAATC 162 0 

TTCCCAACTT TACTACCAAT AATATGTTGA TGTTTTAGTC TTTCAAGTGA TTCTGTAATA 1680 

TCGATATGAT TAAAACTAAT GCGCTCTAAC TGATTTAAAT GTTCTGGTAT ACCAATAATC 174 0 

45 AATGTATTTA TTTCGTTGCT TAGTGTGTTA TTTAATTTAA AATTCATAAT GTACCTCCTT 1800 

AGATTTGATA TGTTACTCAC TTCAAATTGT ACAACAATAA AGCCCTcAGT GACACTGAAG 1860 

GACTTTATAA ATGAATAATT TAATTTTATG TGTTTAGCTG AAACTTTATT TAAAGCCTAG 1920 

50 AATTTACCTT TTTTGAATGC TAAACCGATA CCACCGATTT TGAATACCGC ACGTGTATCA 1980 

ATCACTTTTT TCATGAATGC TGCTTTTTTA CCAGCGATAG GTTTACCAAA TACCATACCT 2040 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTGTTG CATT 2160 

GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 2 2 20 

TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 2 2 80 

CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 2 34 0 

GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CATCTACTTC AACTACAAAA 24 0 0 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 24 60 

AAGTAGCTAA cTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2 520 

GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 2 5 80 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 2640 

AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTCT 27 00 

TCGATATGAC GTGATAATTC ACGTGCT GTG ATAACATTTT CAATTTGGAA AGCATGATCT 27 60 

TTCATACCTT CGATGCCGAA TGTTT CACTA ACGAAAC CTA ATGCTACTAC TAAAATATCA 2 8 20 

AAGTCATAAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACG GTCAAT TTTTGTTACT 2 8 80 

TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 2 94 0 

TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATGTTGCTTC ATAGTGATAT 3000 

TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 3 060 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACGATCTTGA 312 0 

GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC 3175 
(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 314: 

CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 6 0 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 120 

ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 180 

GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATGC CACAT 24 0 

GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GGATTATTCA CCGTGAACCG 300 
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TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGT CAT CTTTTTAGTG 42 0 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 4 80 

5 GGTATTACAA CGTATTTCTC TGAAAAGAAA AAG TATAACA AAGATGTTGA AAAACGAGAG 54 0 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA 600 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 
' S (A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 315: 

TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 60 

25 AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 12 0 

ACGAATGCAA AGCCAAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 18 0 

GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 24 0 

30 

ATTATTGTAC TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 300 

CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 36 0 

ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

35 

GCGATACGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACCAG TAAGAACCAA 480 

TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 54 0 

40 AGACTAAACC TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 60 0 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 66 0 

TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 720 

45 CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 780 

TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 84 0 

TTGTATCATG ACTCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

GCTTCAGTTT CTG CAACTTC AG CAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 960 

CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACAGTTTCG 102 0 

55 



1167 



EP 0 786 519 A2 



TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 114 0 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 1200 

GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 1260 

CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 132 0 

AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 13 80 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 1440 

GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 1500 

TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 15 60 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 1680 

TTAGGGATAA ATGCTAATGA GAATAG CATT GGTGACTCAA ATGTAATACG TCCTTTATAT 174 0 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 18 00 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACC CAA 18 6 0 

ACTAAGAAAC TAAG GAACGC GATACCGGCA GTTGC C CAT A CCATACTTTG ATGTCCGAAT 1920 

AAACGCTTAC GAGC GAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 19 80 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATT GG CAT A 204 0 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACG CTACTG TTAAAGGAGG GAAAGCAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 2160 

ACTGTAAACA TTGG CATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 2 220 

ACAAAGAAGT TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 2280 

TAATAGTTAA CACCCGGACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 234 0 

CCAGCTGCTG GTGAACCACC AATAATAAAT G A CAGGTTGA ATAAAATCAT ACCTG CAAAG 24 00 

AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATGCAA CATCACGTGC TCCAATTTGT 24 60 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2 520 

ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TTTGTTATCA 2580 

GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 264 0 

ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 2 700 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTG CGATTAAAC C AATAACTAAG 2 760 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CTTTAACTAG TAATTGATCC 2820 
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TTGAAATTTT CTTCATTTCT TTCGCATTTT TCGATTCATC TTTCTTGAAC TCATTGTTAT 2 94 0 

ATGGTTCGTC ATTTCCAAGA ATCATCAACT TCATACCATG TCGTTTATAG TTCGCATTTG 3 0 00 

5 TAATTTGAGC TTTACGAGCA GGTATTAATG GTTTGTCTGA TACATCTTTA AACATATTTT 3060 

CTTCACTAGT GAAGTTTGGA TCTTTCAATT CGAAATTGAA ACGTTTATAT G CAT AG AAG A 3120 

TGTATTCTGG ATCGGCTGCT GGATCAACAA ACGCCATATG TGTACCATTA AATTCTAAAG 3180 

10 CTTTATTAGG TGTGCTTGGT AATAATTGTT TATCAAATGT ATCTTGATCT AACGTTTTCT 324 0 

TACCTTTAAC TTCTTTCACC CATTTGTCGT AGTC T TT TT G ACTAACGGCA TTTACTTTAA 3300 

ATGTTTGACG TGAGAATCCT TCACCATTGA AG TT AG AGTT ACGACCTCTG AACGTACCAG 33 60 

IS 

TTTGAGATGC TTCTAACGTC CAATTCATTG TCATGCCAGT CATGGCATAT TTTTGACCAC 3420 

CTAATTGTGG AATCCAGAAA CTTGTCATTG TATCCATAGC TTGAAGCTTA AATACAACAG 34 80 

GACGATCTTT AGGGATTGTT AATGTATTAA CAGTCTCTAT ATGTTCATCT GGATAAGCAA 3 540 

20 

AGAACCATTT GTATCCTGCA CTTACTGCAT ATACAACCAT TGGATCTTTC TCACTCTTCG 3 6 00 

GTGGTTTTTC GTAATCGTAT AAAGTTTTAA CTGTAGGAAT AGctAAAGCA G CAACG ATT A 3660 

?5 TGATAGGTAT TACAAACCAT ATTGTTTCAA TGATGGCATT ATGGTGCATC TTACCAGATT 3 720 

CGGCATTCTT ATTATAACTA TACTTGTAAA TAAAAATGGC GAACATGCCA AGTACAACGA 3780 

AACAAATAAC AAGCATGAAG ACGATTGAAT AAAGAATCAA GAACTTCTGA CTACTTGCTA 3 840 

30 CTGGCCCTTT TGCGTTGAAA ATTTCTATAT TTGAACAACC ACTAAGTAAA ATTAGTGTGC 3 900 

CAAATAATAG AAGCAAAGAC TTAAATTTTG ACACTTTTTT GACCTCCTAA TACTACAAAT 3 960 

GTAGGGCTTA ACATTAATTT TAAGTTATTA CACAATATTT ACAAGGGCTT ATGGGAAAAA 4 020 

35 AATTAATAAA ATTGTATCAA AAATGTTGAT AAATCAAGGT GTGACGTGGG TTCACACATT 4080 

TGTTAAAATT ATGTGTACAT TTTGTGACTA ATAGCGTTTT T 4121 
(2) INFORMATION FOR SEQ ID NO: 316: 

to 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

ts 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

;o CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 60 

ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 120 



SS 
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TTGGATTACG ATTAGTTGAA AAGTCGGTTA ATCAAGACAA TCCTTCAATG TATCATTTGT 24 0 

TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 300 

S CGGGTCATAA GCAGCCAGGT ACTGAAACGA TTTATCGATT TTCATTATTA GTACCAAATC 3 60 

AAGCGGCACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAAG TCTGAACGTT 4 20 

TGTACTATCT TGGACAAGAA GGTGTTGTCT TTAAAGATGA AGACGACTTA GAAATCATAT 480 

"> TGCTTGTTAA TGATAGTTTT GAAGTACCAC AT CAATGG CA ACATAACGCT TATAGTGAAA 54 0 

TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 60 0 

CGCGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AATAAATCAT 66 0 

15 

TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TTTTGTAGTT ATTGAGCAAC 720 

AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT AATACACCAC 78 0 

2o AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA ACAACCACAA AGTAATTCAG 84 0 

GTATAATTGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 900 

AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TGTTGAACAA TTAGGAAGTC 960 

2$ AATTGAACTT GCCTGACTTT TTAGAAGCAG AACGTGAACA AATTGAAAGT AAGTTACACG 1020 

AAATATAAAG GAGAATGTTT AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTCTTGAG 108 0 

TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA AAAAGTTAGT 114 0 

30 TATGAGCAAC GTATTAATGA AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 12 0 0 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG GAGCATTTTA CAACACAGGC ACATACGGTT 1260 

GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATATTAAAG TTTCAAGTTC TTCAACGATT 1320 

55 ATTAGTG CAc AGATC CTGTA AGAGTATTTG AAGACTTCGC GACATTAGAT TTGATTTCTC 13 80 

ATGGTAGAGC CGAAATTGTA GCTGGCAGAG CATCAAGAAC AGGTATTTTT GACTTGTTTG 1440 

GCTATGATTT AAAAGACTAT GATGAATTGT TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AGAAATATGA 1560 

AAATATT CCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

45 CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTGGTG 1680 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 1740 

GTTTCGATgc TTCGCCTAAG TCTTT AC CAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 1800 

50 AAACAACTCA GGATGCTATG AGAGAATTTT ATCCACATTT GAATACAGGG ATGTCATTTA 1860 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC GT CAGATT AT CGAGAAGCGC 1920 
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GTCATCAACG TTTTATGGCA CAGCTTGATT TTGG CGGTGT GCCATTTGAA AATGTTATGA 2 04 0 

AGAATATTGA GTTAATTGGC AACGACATTA TACCGGCGAT TAAAAAGCAT TTATCAAAAT 2100 

5 AGGAGGGGCG TCATCATGAA TATTGTATTA TTGTCAGGTT CCACAGTAGG TTCTAAAACG 2160 

AGAATTGCTA TGGATGATTT AAAAAATGAA CTAGAAGTCA TCAATGAGGG ACATCAAATA 2 220 

GAGTTGATGG ATTTACGAGA ACTTGAATTA GAATTTAGCG TTGGAAAGAA TTATCTAGAT 2280 

'° ACTACAGGAG ATGTATATAA ATTAACGACG TCGTTAATGC AGGCTGATGT GATTTTTATT 2 3 40 

GGTTTTCCAA TTTTTCAAGC TTCCATCCCT GGTG CTTTGA AAAATGTGTT TGATCTACTT 2400 

CCAGTCAATG CGTTT CGTGA CAAGGTAATA GGACTTGTAG CGACAGCAGG TTCTAGTAAA 24 60 

15 

CATTATTTAA TTCCTGAAAT GCATTTAAAA CCAATATTGA GTTACATGAA AG CACATACG 2 520 

ATGCAAACGT ATGTATTTAT TGAAGAGAAA GATTTTTCAA ATCAACAAAT TGTCAATGAT 2 580 

GATGTTGTAT TTCGGTTAAA AGCGTTGGCA CAATCCACAA TGCGAACTGC CAAAGTACAA 2640 

20 

CAACAAGTGT TTGAAGAAGA AAACAACCAA TACGACTTTT AAAGTATAAA AATAAGACGC 2 700 

TCGGCACACT AAATTTGTAA GTGTTTGAGC GTCTTTTCAT ATTAACTATA TAGCCAATGA 2 7 60 

2S ACGACGATAA AG GCAAGTGA TGACAAG CAT ATTGAGGTAA TAATGATTGT CATAAGCGGT 2 8 20 

TTAAGTGCGC GATTTTTAAG ATCTTTAAAT GCAACATTTA ACCCTAAAGC AACCATGGCC 2 8 80 

ATTAATAAGC AAATTGTTGA TACAGTATTT AAAATATTTA GCAATGCTGA CGGAATAGTT 2 94 0 

30 ACATATGTAT TCACTAAGGC CATAATGACA AATCCAATTA AAAAGTATGG AATGCTTATT 3 0 00 

CGACCCTTGC TAGATGATTC TGATGAA CGG AAACGCATAA TTAAAATAAG TACGATGGTT 3 0 60 

AATGGAATCA GTAAGAATAC TCTACCAAGT TTACCAAGAA GTGCAATTTT AAGTGCATCA 3120 

35 CTACCACCAA AGCCACCAGC TAAGACAACG TGTGCAATTT CATGAAGACT AACAC CAG AC 3180 

CAAGCGCCAT AAACATTTGT CGTCATTGAA AAGATAGCGT AGATAGCTGT ATATATAAGT 324 0 

GAAAATATCG TACCAATCAA TGCGATGATA CCGATACTAA TAGCTGTATC CTTTTCACGT 3 3 00 

GATTTGAATA TTGGAGCGAC TGCGGCAATA GCAGCAGCAC CACAAACGCC TGTGCCGACA 33 60 

CCTAGTAATA ATGCGATGTT TTTGTCACCA TGCAACAGTT TGTTGACAAA GAGCATCATT 3 4 20 

4s ACAATACTGA AAATAACGAC ACCTACATCG ATGGCTAATA GTTTACTAC C TTGAC CGATA 3480 

ATATCGAATA TATTGAGTTT AAGTCCATAT AGGATGATTG CAAATCTTAA TAAATATTTA 354 0 

GATGAAAACG TAATACCTGA GCTATATTGT TCAGGATATC CTCTAAAGTG ACGATATAGA 3 6 00 

SO ATAGCGATTA ATATCGCGAT AGTTAATGCG CCAACCTTAT CTAGGATTGG CAATTTAGCT 3 660 

GCTAAAAAGC TAAATAATGC GACTATAAAT GTTAGTGATA GCCCAATCAT AAAATGCTTA 3 72 0 

SS 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT AAAGTAGTGA TTAAATTAAG TTGTGTGATA 3 84 0 

GGAAACTTGG ACATCAATCA AAGTAATAGG CACTACAACG CTTATTGGCG GGGCCCCAAC 3 900 

5 AAAGAAGCTG ACGAAAAGTC agCTTGcAAT AATGTGCAAG TTGGGGATGG GCCCCAACAT 3 950 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC AATG CAAGTT GGCGGGGCCC CAACATAGAG 4 0 20 

AATTTCGAAA AGAAATTCTA CAAGCAATGC AAGTTGGGGA AGGACAACAA ATTTAAGATA 4 0 80 

'° CAATGCGTAA CATTAATATG TTATTATAAT GATAATTTAC AGAATTATAT GAAAAATGAA 414 0 

TGAGGATGTG ATGGTATGTT TGG AATG AAA GTGAATGAAC AAATAACATT AAAAATTTTA 4 2 00 

GAAGCTCATG ACACAGAAGC GCTTTTCAAT TTAGTCAATC GTTCAAGAAA TTCACTTAGG 426 0 

IS 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG CAACCATCAG ATACGCGTGC ATTTATTAAA 43 20 

AGAGGACTTT TGCAATTTG C TGATGGTAAT GGATTTCAGT GTGGCATTTG GTATGAAGGA 4 3 80 

2o ACGCTAGTTG GTGTCATCGG TTTACATGAA ATTAATCACA TGCACAGAAA AACTTCATTA 444 0 

GGGTACTATT TAGATAAAGA ATTTGAGGGT CATGGGATTA TGACACAAGC AGTTGAGGCA 4 500 

TTGATAAAGT ATTGTTTCGA AGAGCTTGAC TTAAACCGAA TTGAGATTAG TGCCGCAGTT 4 5S0 

25 AAT AATG AAA AAAGCCGGGC TATTCCTGAA AGG CTGGGAT TTACTAGAGA AGGTATGTTA 4 620 

CGTGACAATG AATTACTAAA TGGTATTTAT TCATCGAGTT ACATCTATAG TTTATTAAAA 4680 

TCAGAATACG ACCAAAAATG ACAAATTAGA CTTACAAAAG AGTGATGACA TTTAAAATGG 4 74 0 

30 CAGCGCTCTT TTATTTAATT TTTGAAAATA AAAGGTTGTT GACAGTATTA TTTTATAACA 4 800 

ATATAATGAT TTTGATAATT ATTATCAACT AGATGATGTT TATGGGAGGA TGCTTTAAAA 4 86 0 

CAGCCGTTTT AAGTGTAATG TATTATTTTA GCGTGTAGGG AATGCGAAAA TAATATTTAT 4 920 

35 AAGAACACAT CTATGGGGAT AATAGAATTT CTATAATGAG GTGTCAAAAT GAAAAAGTTA 4 980 

ACAACGCTAT TATTAGCATC AACGTTATTA ATTGCTGCAT GTGGGAACGA CGATAGTAAG 504 0 

AAGGATGATT CAAAGACATC GAAAAAAGAT GATGGTGTTA AAGCAGAATT AAAACAAGCA 5100 

ACAAAAGCAT ATGATAAATA TACTGATGAA CAGTTAAATG AATTTTTAAA AGGTACAGAA 516 0 

AAATTTGTTA AAGCGATTGA AAATAATGAT ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 522 0 

45 GTTCGTATGT ATTATGAACG CTCTGAACCA GTTGCAGAAG CATTTGGAGA TTTAGATCCT 528 0 

AAAATTGATG CACGTCTTGC AGATATGAAA GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 534 0 

TATCATAAGA TTGAAAAAGC ATTATACGAA GATAAGAAAA TTGATGATGT GACTAAAAAA 54 0 0 

50 GATGCACAAC AATTATTGAA AGATGCAAAA GAATTGCATG CCAAAGCTGA TACATTAGAT 54 6 0 

ATCACACCAA AATTAATGTT ACAAGGTTCT GTTGACCTAT TAAATGAAGT TGCAACTTCT 552 0 
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GTTGAAGGCG CACAAAAAAT 


TTATGACTTA 


TTTAAACCTA TTTTAGAGAA 


AAAAGATAAA 


5640 




AAATTAAGTG ATGATATCCA 


AATGAACTTC 


GATAAAGTGA ATCAATTATT 


GGATAAATAT 


5700 


£ 


AAAGATAACA ACGGC GGTTA 


TGAGTCATTT 


GAAAAAGTAT CGAAGAAAGA 


CCGTAAAGCA 


57S0 




TTTGCGGATG CTGTTAATGC 


ATTAGGAGAG 


CCACTAAGTA AAATGGCTGT 


GATTACTGAA 


5820 




TGACAAATTA TGAACAAGTT 


AACGATAGTA 


CGCAATTTTC AAGACGTACA 


TTTTTGAAAA 


5880 


10 


TGTTAGGTAT TGGCGGTGCC 


GGTGTTGCAA 


TTGGCGCAAG TGGTGTTGGT 


AGCATGTGGT 


5940 




CTTTCAAATC AATGTTCAAT 


ACACCAGAAG 


ATCCGGAAAA AGATGCGTAT 


GAATTTTATG 


6000 




GTAAAGTGCA ACCAGGCATT 


ACCACACCCA 


CGCAAAAAAC ATGCAATTTC 


GTTGCGTTAG 


6060 


15 


ATTTGAAGTC AAAAGATAGA 


GATGCAATTA 


AGGCAATGTT TAAAAAGTGG 


ACGGTTATGG 


6120 




CTGAT CGTAT GATGGATGGT 


GATACAGTTG 


GCAAGCCGAG TAACAATCCT 


TTAATGCCAC 


6180 




CAGTAGATAC CGGTGAATCG 


ATAGGATTAG 


GTGCAAGCAA GTTAACGATT 


ACCTTTGGGA 


6240 




TTAGTAAGTC TTTGATGAAG 


AAAATTGGGT 


TATCTAGTAA AATTCCCGAT 


GCCTTTAAAG 


6300 




ATTTACCGCA TTTTCCGAAT 


GATCAGTTAA 


TAGACGATTA CAGCGATGGT 


GATATTATGA 


6360 


25 


TTCAAGCATG CTCAAATGAT 


TCGCAAGTAT 


CCTTTCATGC GGTTCATAAT 


TTAGTTCGTC 


6420 




CATTTCGAGA TATTGTTAAG 


GTACGTTGGG 


CGCAATCTGG TTTTATCTCT 


GCTAAAGGTA 


6480 




AGGAAACACC TAGAAATTTA 


ATGGCATTTA 


AAGATGGAAC AATTAATCCT 


AGGAAGAATA 


6540 


30 


ATCAACTTAA AGATTATGTG 


TTTATTGATG 


ACGGATGGGC GAAACATGGA 


ACTTATTGCG 


6600 




TTGTCAGACG TATTCAAATA 


CACATTGAAA 


CGTGGGATCG TACTGCGCTG 


GAAGAACAAG 


6660 




AGGCTACATT TGGTCGGAAA 


CGACATAGTG 


GTGCACCGTT AACAGGTGGG 


AAAGAGTTTG 


6720 


35 


ATGAAATTGA CTTAAAAGCG AAAGATAGTC ATGG CGAGTA TATTATTGAT AAAGATGCCC 


6780 




ATACGAGGCT AGCGAAAGAA 


GCAAATACGT 


CAATTTTACG TAGAGCCTTT 


AATTATGTGG 


€840 


40 


ATGGTACGGA TGACCGCACA 


GGTAACTTCG 


AAACAGGCTT ACTTTTTATT 


GCTTTTCAAA 


6900 


AAGCGACAAA ACAATTTATC GATATACAAA 


ATAATTTAGG TAGTAATGAT 


AAATTAAATG 


6960 




AATATATTAC ACATAGAGGT 


TCTGCTTCAT 


TTTTAGTATT ACCAGGTGTT 


AGT AAGGG AG 


7020 


45 


GATACCTTGG TGAAACATTA 


TTTGACTAAA 


TTT GTAGCAA TGCTAATAAC 


TGCTGCTATG 


7080 




GTGTGTAGCT TTGGGTTACT 


GAAAAGTCAG 


GCAGCAGAAC AACAAAGTAT 


TAGTGATG T A 


7140 




TATAGTGTGA TAACGGATGC 


GAAATCTGCA 


CTTTCTAATA ATT CGATATC 


GAATGACAAT 


7200 


50 


AAGCAGAAAG CAATTGAGCA 


AGTGGTAAGT 


GCAGTTAAGA AATTATCGCT 


TGAAGATAAT 


7260 




AGTGAAAGTA ATGCTGTCAA ATCAGATGTG AGAAAGCTTG AAGATGCAAA AGCGAATGAT 


7320 



55 
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GCTAGTAAAG ATGCGGGTTC TAAAATTAAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 744 0 

GCTGCGATGA CAAAAGCGAT TAAAGATAAA AATAAAGCGG AATTAGAATC TTTGAACAAT 75 00 

AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATGA CGCAAATCAA 7 56 0 

TATGGACAAA TTGAAGTCGC ATT AT TACAA CTTAGAATTG CAATTCATAA GTCACCATTA 7620 

GATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTGA TCATGTCGAT 7 6 80 

AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATG CGTTA 774 0 

GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGT CGG ATGCTGcaTg TGCGCTTACA 7800 

CATTTTATAG AAACTTGGCC GTATGTTGAA GGTCAAATTC AAACTAAAGA CGGTGCTTTG 78 60 

TATACGAAAA TTGAAGATAA AATAC CATAT TATCAAAGTG TATTAGACGA ACATAATAAA 7 92 0 

GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 7980 

AGTTATAGCT TCGT CGATGT GATGATTATC TTTTTACGTG AAGGGCTAGA AGTGTTGTTA 804 0 

ATTGTAATGA CATTGACTAC CATGACGCGT AATGTAAAAG ATAAGAAAGG GACTGCAAGT 8100 

GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAGCAAT TACGTTTGTA 8160 

GAAACTTTAG GGAATAGTGG CATTCTTCGT GAAAGTATGG AAGCGGGATT AGGTATCGTT 822 0 

GCGGTCATAT TAATGTTTAT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 8 28 0 

TGGAATGACA TGATTAAAAA TATGTATGCT AATGCGATTA GTAATGGTAA TTTGGTATTG 834 0 

TTAGCGACGA TTGGTTTAAT ATCTGTGTTG CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 84 00 

ATGGG GATG A TAGGTGAGCT AGCGACCAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 84 60 

GTTATTTTAA T CATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT ACCTATTTTC 8 52 0 

TATATATTTA GAGTGTTGTC GATCTTTATT TTTATTATGG GATTCAAAAT GCTTGGCGTA 8 580 

AGTATTCAAA AGTTACAATT ATTAGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 864 0 

ACGATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTGCTTAT 8700 

ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 876 0 

GTTTTTTTTG TTGCTATATT GGAAATATTC GGTATTGCAG TATAACGATA ATCACAGCAT 8820 

TGATTCGTAT AAGGTTAATG TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 8880 

ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TGTTAAATCG 8 940 

GGTGGTGCAA TGATACTTGC TAATACAAAG CAAGCGAAAT AAATATATTT ACG rTAATGT 9000 

TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 9060 

TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGAGTT CAATCAAATA TGCTTTAAAG 9120 
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GGAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA 924 0 

CTATATTTAT AAATAAATTG ACGTT CATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 93 0 0 

AATTGATAAA 9 310 
(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS .- 

(A) LENGTH: 34 58 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

ATTAGTATTA CACACTAATG TAAATAGATT GGTTGGAGAA GAGATATTTG CTAATAAGTG 6 0 

CCTTG CTAAT AATGATGTTC AAATTATGAA TTCAATAAAA AAATTAATTG AAGCTGAGTT 12 0 

ATTAACAACG ACTAATGATT TTGAAGTTAG TATATATAAA AAGACAAGAC CTGAATTACA 180 

AAGTATTTTA AAAAGTTTTG GTATAAAAAC AACAGGTAAT AAACCTGACT TAATTAAACG 24 0 

TATTGACGAC AATTTTCATA TTATTAATAA CTTAGATTTA CCATATGTAT ATATAC CAAT 3 00 

TAAGAAAGGA GAAGAAATTT TAAAGAAAAC AGAGTACTTA ACCTCTTTTA TACAAAGTTA 360 

TGGTGAAATT TCTCTTGAGC GTGCTTATTA TTTGGTTGAA AACTATATAG ATGAAAATTG 42 0 

CGATGATAAA GTTGCAGAAA TATACAAGTT TGAATTTCAA AGAAAATATG ACAATGGCGA 480 

GTTTGATTTT AATCATGGAT ATAATTTCGA ATTGAATATG TTGATAGATC ACTATAAAAG 54 0 

AGATGTAAAA GACTACGATA ATGCCAGAAA GTATTCAAAT ATTTATCTTT ACTTTGGTTT 600 

GAGAGATTTC TTAAAAAAAT TAATGAGTAA TTATTCATAT TACGATAgTA AAGGGAATAT 66 0 

AGATTTGAAC GAAATACAAA ACGATCTGAA TAGATTTATC AACTCTAGCG CTTCTGGTAT 720 

GTACGAGCGA TTAATATATA ATGAAAATTT ATCCAATAAT ATTATGTTTG AATTATTTAA 780 

AAAGGACACA CAAGATTATA GTGATTTGGA AGAACAATTG ATTGAAAAGT T CAT AAACT A 840 

TGTAGTGTCT AATGTAAAAA AAGAAAGTAG GAGTAATACT CTTATAGAGT TATCAAAAAT 900 

TTTAGAGAAC GGATATACAA TTGATAAAGA AGAATTTAAA AAAGAAGATG ATTATCTTTC 96 0 

TAAGTACATA TTTACTGACA TAGATTATTT GAAAAAGTTA GAATCAAAAA TAAACGTTGC 102 0 

TATTGATATT CGAAGTGGAG AAATTCATTT GGTATTAGAT GATGATAGCC TTGATATATT 108 0 

AATACAAAAT CAAAAATACG GCAATGAGTT TTGAGTCATA ACTAAATATA ATATGTTGAA 114 0 

GAGAGGTTTT ATTTAAATGG CTAAAATTGG TTATGAT CGT GTATCAACGA AAGATTAATA 1200 
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AACGTACAGA GCTTGTTAAG TGTTTAGATT ATTTACGAGA GAGCGACACA TTAGTTGTCT 1320 

ATCAACTTGA TCGGTTAGGT AGAAC GACAA AACTATTaAT TGAATTaTCA CAATGATTCG 1380 

5 ATGATAACGG AATTGACTTA CAAATTAGTA ACATGAACAT TTCAA CGAAA GACACAATGG 14 40 

GCAAAATGTT TTTTACGATG ATGAGTGCAT TTTTCaGGAT TAGAAGTTAA TTTACTATGT 1500 

GAGTGTATAA AATAGACTTA GCAGCAACAA GAGCGAGAGG CCGAAAAAGC GGGCGCCCCT 15S0 

10 

CTTTACCAGA GAATAAAAAA TGAGAAATTA AATTTTTATA TGATGAACAA ACGATAACAG 16 20 

GGGAAGAAAT AGCTAGTTAG ACAAGGGTAT GTCACTCAAC TGTTTATCGA GTTGATTAAG 16 80 

AAATGAAAAA ACTTATACTA TGAATTACTG TTTAAAAGTG TGCATGTTAT AATATTTATT 174 0 

J5 

GAGCAAGTTG GATAGATGGT GGCTAATCTC TTAATAAAGG GGTGATGCCT ATGGTTATAG 18 00 

TTGTTACTCC TAGnAAAGGA CTAGCATGTC TGATTTTGAA ATGCTTATGG TTGTATTAAC 18 60 

20 AATCATTGGT TTAGTATTGA TTAGTACTCA AGACCATAAA AAATAaCCTT CTATTCGCTT 1920 

TGACCGGCAT TTTTGAAGGC TATTTTTAAA TAAAATATAA GGTCACCGTC TTTTTAACGG 198 0 

GCTCATTAGG GTAACATGTT TCCGAGTGTT GCCCTTTTTG TGTTTCAAGA GTTAATGATT 2 04 0 

25 TTCATCTTTT GCTTCTACTT GCTACAAATA TATTTTAACA CATTTTTCTT ATGAATTGTA 2100 

GTTCTGAACA TAATCAGAAT TAATAAAACC AACTTTCCAT ACAGCAGAAA ATACAATTAA 2160 

AAGTATAGAA TGTAATCACG AATTATATAA AAAGTATTGG AGACCTCTTC ATATATAGAT 22 2 0 

30 AATTCACTTA GTTATTTTAG AAAGAAGCCC CTAaCAACTA AAGTTGAAAA ATAGAGGAAC 22 80 

ACAGTTg GAT TACGCATCAA CTGCATaAGg CCCCTAAmAA CTAAAGTTGT AAGGGGCyCT 234 0 

AAAATTTATT TTGGTTGATT GTCTTCTGGT TTATCTGAAG TCATTGTTTT TGTTGTATTA 24 00 

35 

TCATTTAATG AATCTTGTGC TTTTTCTGTT TTTGTTTCAA CAGATGTTGT CACTTTATCA 24 6 0 

TTTTCTACTT TTGTATTTTT TGGTTCAGTG ACAACTTTTT TATTCTTGTC AGCTGTTTTT 2 52 0 

ACTTTATCTT TAGTAATTTT TTGACCACGT TTTAAGTAAT ATTGAACGAT ACCCATTAAA 25 8 0 

40 

ATGATTGCAT GAATAACAAC GAATAAAATA ATTGTCATTA CTGTATAAAC T CTTATG AT A 26 4 0 

TTTTCTGCAA CACTTTGAGA GAGTGATTGC GTATTAAATG AAATCAAGTA ACCAACAGGT 2700 

4S GTTTTAAATA AAATAATAAT TAAGTTTAAT AATAGTATTC CAATGAAAAA TTTGAAAAAT 276 0 

GTTTTTTGAC CATTTTTCAT TGCTTTAAAT CCATTAG CTA AGTGTGTTTT TACTTTATCG 28 2 0 

TTAG TTGATT CaACGAAACT AGTCaTAAAG TTGAAAATAG GTATTAACAA TAACCAAGTA 28 80 

so ATAATTGCTT TAATCAATAA AACAACAATG ATTATGATGC TTTGAGTTGT AATAGCAATA 294 0 

CCAATCAGGT GTGTACTATC TGCATATGAT GATTGTACTG AATTCATAAT CATTTCTGAT 3 00 0 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 



3120 



AAATCGGTAA ATTTTAC TTT TTCATGACTC ATTGCTCGTT TCATAACATT TGTAATTCCG 



3180 



5 



ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 



3240 



ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 



3300 



TGTGCATACT TAGCAAGCTG G AAATT AG CA AGTAAATATA CAACTGCAAT TACAGCAAAT 



3360 



GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 



3420 



TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 



3458 



(2) INFORMATION FOR SEQ ID NO : 318: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION : SEQ ID NO: 318: 

25 GCTTGCAAGA TGCTTTCATT AAAGACAAAG TCATCGATAT TATGAATATG TTCCAAAATC 6 0 

ACACTGATAT CACTTATACG TTGAATAAGT CTCATGCACA TATTATATGT ACGCCAGAaA 120 

TATTTGCTAA ATTGTTACAT ACGATTGCAA CTAGAAATAT CGACATTCTA TCTGCCAATT 180 

50 ATAGATCGTC ATCTATGAGC AAAGCTCGTA TATCATAATA AAAGACACAT CTATACGATG 24 0 

ATCATTTACA ATGATTGCGT ATAGTGATGT GTCTTTTTTT GTATTATTTA TCTTCGGAcG 3 00 

ATTGATCCTC AACCATTTGT TCAAATGCTT CACGAACTTG TGGTACTGTC ATACCTACTA 36 0 

35 

CAACTTGGAT ACTCTTACCA CTCTTTACTA ATCCGTGTGC CATTTGTTGA TGCGTGAAAT 42 0 

ATTCCGTATC TGCAACTTTA CTTTCATCAT AAACAGTTAA GCGTAATCTT GTTGTACAGT 4 80 

TAGTAACATC TTTGATATTT TCTTTGCCGC CTAAACCGTC GAGGTAATAT GCAGCTTTGT 540 

CTTCGTATTC ATTACCAGAT GAAGCACGTT TAGAATCTAC GCTATCACCT TTTTTATTTT 600 

TGTAATCTTG CTTAGAAAAT AATTTAACTT CCTCTTCTGT TTyCTTACGT CCAGGTAATG 66 0 

4S GAATATCAAA TTyCAAAATT AAGAATCTGA ATAAG 695 
(2) INFORMATION FOR SEQ ID NO: 319: 



li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

TTATATATAG TCGAATCaAT GATGACCCaC ATCGACCtAA TGCAAATAAT 60 

AGCAGTTAAA TGACTTGTAT TGCTCGCGGA ATTTAAGATA TAATGAAATC 120 

AATTTGATAG TCATTTGCTA TCAATTTCAG AAATATATTA TGAATTG CTA 180 

AAGCGAGTGA TCAGTATTAG AGAGAATAGA GCGTTAAGAC TCTATCG CCG 24 0 

AATTTATTAC GAAACTCTCA GGcAAAAGGr TAATACTGTA ACGCGTTCCT 3 00 

TTTATAAACA GGGTAGCGAT TGCTATCCTG TTTTTATAAT TTTAAGGGGG 360 

TCAAGTGATT TAAAACAAAC ACCTTTATAT CAAAATTATG TTGATAGAGG 4 20 

GTGGAATTCG GAGGATGGGC GATGCCTGTT CAATTTTCAA GTATTAAAGA 480 

GCTGTTCGAT ACGAAATTGG CCTGTTTGAT GTTAGTCATA TGGGTGAAAT 54 0 

GGTAAAGATG CTAGTCAGTT TGTGCAATAT TTATTATCAA ATGATACTGA 600 

ACTTCAAAAG CATTATATAC TGCTTTATGT AATGAAGAAG GCGGTATTAT 6 60 

GTAATATATA AATTAGCTGA CGACAATTAT TTATTAGTTG TTAATGCTGC 720 

AAAGATTTTA ATTGGATTTT AAAACACAAA GAGAAATTTG ATGTTGAAGT 780 

TCAAACCAAT ATGGTCAATT AGCAATACAA GG AC CAAAAG CtAGAGATTT 84 0 

TTAGTTGATG AAGAkGTAAC TGAAA 875 
30 (2) INFORMATION FOR SEQ ID NO : 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 320: 

TCTTTTTACG AAGGCTACCG TTTAGGATCT GATAACTTAA CTGTAGGAGA AATTGTATTT 60 

AATACAGCGA TGACAGGTTA TCAAGAAACT ATTTCAGATC CATCATATAC AGGTCAGATC 120 

45 ATTACTTTTA CGTATCCATT AATCGGTAAT TATGGTATCA ATAGAGACGA TTTTGAATCA 180 

TTAGTACCTA CATTAAACGG TATTGTAGTG AAAGAAGCGA GTGCGCATCC AAGTAATTTT 24 0 

AGACAGCAAA AGACACTTCA TGACGTTTTA GAATTGCATC AAATTCCAGG GATTGCAGGT 3 00 

50 GTTGATACAA GAAGTATTAC GCGTAAAATT CGACAACACG GTGTGTTAAA AGCTGGTTTT 3 60 

ACTGATCGAA AAGAAGATAT TGATCAACTT GTCAAACATT TACAACAAGT AGAATTACCT 4 20 



AATATTGaTa 
5 AAGACAATCA 
GCATTCAAGA 
AATTTAATAA 

10 

AAGtGCAAGT 
GAATTGGTGA 
TATTTCAATG 

IS 

TGCAAAAATT 
GGAGCATAAT 

20 TGAAGTAACA 
TAATTTAACT 
TGATGATTTA 

25 TAATACTGAA 
ACAAAATGTA 
AATTAATCAA 
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AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCGAGAATT AAACGTCAGA 540 

GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT 6 00 

5 CCAGATGGCG TTATGCTATC AAACGGACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 

CCAATGATTC AAGGAATTTT AGGGAAAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 720 

CTTTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 78 0 

10 

AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 84 0 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 900 

is AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 96 0 

CATCCTGAAG CAAATCCAGG ACCGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 

ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 1080 

20 AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 1140 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA G CGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 1260 

25 GTTAACTCAT GATTTTATAG CGCGAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 13 80 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 1440 

30 CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTA CCAG AGAGTGACAT 1500 

TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 1620 

35 

ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGa TAAAAA CGATAATGCC 1740 

4Q ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TT CAT ACAGG CGATTCAATT 1800 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 1860 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AG CATTAGAT 1920 

45 CCCCATTCAT TCGATTATTA TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGCG 1980 

TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 204 0 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

50 ACTTTAGACT ATGTGATTT C AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 2220 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACGAAGAT 2 34 0 

GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 24 0 0 

ATAATATGAC TCAGATTGAT TACTTCTTCT TACACAAGTT CCAAAACATT ATTGATATTG 24 6 0 

AGCAT CAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 2520 

GATTT AG TG A TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 258 0 

TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 264 0 

TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 2700 

CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 276 0 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 2880 

ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAACCTAAAG 294 0 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 3 00 0 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCG TGCTGAA GATAGAAAAG 3 06 0 

AATTTGAAGC ACTATTAAGA AAAATTAACG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3120 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3180 

CTTATGTATT AGGTGGTCGC G CAATGG AAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 3 24 0 

ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3 30 0 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 33 6 0 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 34 2 0 

CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CTATACTATA AAATTAGCTA 34 8 0 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATGGTGTGT 354 0 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 36 60 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 3720 

GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 37 8 0 

AAGTGATGGG GAAAGATACT A CATT AG AAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 3 84 0 

GCGTTGAAGT TAAAGAT CAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 39 00 

AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3 960 

GAAcAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4 02 0 



1180 



EP 0 786 519 A2 



TGACTAAAGG TAAAGAAGTA GAAAGGGATG GCTTCCAAAT TAGACGTACT ACAGTTGAAA 414 0 

ATGGTATTCC ATGTTTGACA TCTTTAGATA CAGCTAATGC CTTAACGAAT GTAATTGAAA 4 2 00 

5 GTATGACATT TACAATGCGT CAAATGTAAA TCAATCAAAC TGTATCGGTG GGGCTGTAAT 4260 

TAACCATTTA CTTAAAGAAG TTTATATTAC AGCCTCATTA TTTTAATGAA TTTCTTAATA 4 3 20 

TAAAGGGAGA CaTATATGAT GAAAGATTTA CCAATTATTG CATTAGATTT TGAATCAAAA 43 80 

GAAAAAGTAA ATCAATTTTT AGATTTATTT GATGAATCAT TATTCGTAAA AGTAGGTATG 444 0 

GAACTTTTTT ATCAAGAAGG TCCTCAATTA ATTAATGAGA TAAAAGAAAG AGGC CATGAT 4 500 

r5 GTATTTTTAG ATTTAAAACT GCATGATATT CCTAATACAG TTGGTAAGGC GATGGAAGGA 4 560 

CTAGCTAAAT TGAATGTTGA TCTGGTAAAT GTTCATGCTG CTGGTGGCGT AAAAATGATG 4 620 

TCTGAGGCCA TTAAAGGATT AAGAAAACAT AATCAAGATA CAAAAATTAT TGCAGTAACA 4 6 80 

20 CAGCTTACGT CAACAACAGA AGACATGTTA CGACACGAAC AAAATATACA AACATCGATT 4 74 0 

GAAGAGGCCG TTTTAAATTA TGCCAAGTTA GCAAATGCAG CTGGTTTAGA TGGCGTTGTT 4 8 00 

TGTTCACCTC TTGAAAGTCG TATGTTGACT GAAAAGTTAG GTACATCATT TTTAAAAGTA 4 860 

25 ACACCAGGTA TTAGACCTAA AGGTGCATCT CAAAATGACC AACACCGTAT TACGACACCG 4 920 

GAAGAAGCAA GACAGCTTGG TTCGACGCAT ATTGTAGTCG GTAGACCGAT TACACAAAGT 4 980 

GACAATCCAG TCGAAAGTTA TCATAAAATT AAAGAAAGTT GGTTAGTATA ATGGCTAAAG 5040 

30 AAATTGCAAA ATCATTATTA GATATTGAAG CTGTAACATT ATCACCAAAT GATTTATATA 5100 

CATGGAGTTC AGGTATTAAA TCACCGATTT ACTGTGATAA CCGTGTTAC G TTAGGTTATC 5160 

CTTTAGTTCG AGGCGCAATC CGCGATGGTT TAATTAACTT AATTAAAGAA CACTTTCCTG 5220 

35 

AAGTAGAAGT TATTTCTGGT ACTGCAACAG CTGnTATTCC ACATGCAGCT TTTATTGCTG 5280 

AAAAATTAAA ATTACCAATG AATTATGTTC GTTCATCAAA TAAGAGTCAT GGTAAGCAAA 534 0 

40 ATCAAATCGA AGGTGCTAAA AGTGAAGGTA AAAAAGTAGT TGTGATAGAA GATTTAATTT 5400 

CGACAGGGGG ATCTTCAGTC ACAGCAGTTG AAGCCTTAAA ACTAGCAGGT GCAGAAGTAT 5460 

TAGGTGTTGT AGCTATCTTT ACTTACGGTT TGAAAAAAGC AGATGATACA TTTAGCAATA 5520 

45 TTCAACTACC TTTTTACACT TTAAGTGATT ACAATGAATT AATTGAAGTA GCTGAmAniTG 5580 

AAGGTAAAAT TTCTAGTGAA GATATCCAAA CATTAGTTGA ATGGAGAGAC AACTTAGCAT 564 0 

AATATAGACA CTAGAAGGAG GAATTCAACA AATGAATGAC AAAACATCTA ATGATTTATA 57 00 

50 TGGGAAGATA AAACATTGTA ACGAATTTAT CAATCATTCA AATGATTCCA ATCTATCTAG 5760 

TAGTCACGAT GTCGACGAAA GTTCAACGAA GCAAAAACAT ATAAAAAATA AAACAACTAT 5820 
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TTAAAAACAA AAAAGCT 

(2) INFORMATION FOR SEQ ID NO: 321; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7965 base pai: 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 6 0 

TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 120 

GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 180 

CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 24 0 

GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 3 00 

TGAGG CTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 360 

TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TATCCGGATA ATGTGCCTTA 42 0 

ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 4 80 

TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 54 0 

AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 600 

kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 66 0 

GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 720 

CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATGAAGC TATCTGAACA 780 

TTATCGATCT CAAATCCTTG ACCATAATCA ATCACGTCAA TATATAGTAT ATCGTT CATT 84 0 

TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 900 

TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 960 

ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATGAATTAAA 1020 

CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTG ACT CG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 1140 

CCATACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAAT CATGT 1200 

AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 1260 

AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 132 0 



1182 



EP 0 786 519 A2 



TGACCCGATT TAAGTGGTAC CGTTGCATGA TGTGTAATAT TGTCATTTTG GCTAGGAAAT 14 4 0 

GCTTTAGAGG CTAAGTTAAT ACGAGAACAA TTGACGATAT TCGACGCTTT CATTAGCCTA 150 0 

CGTTGATTAA ATGCTTTCAC ACACCAACAA GACCCATCTT TAATATAGTG ACAGTGGTCT 156 0 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT GATACAAGTT CATGTTCACC TACGCTATTG 16 2 0 

ATGAAAAATA TCCAGCCTGT CGTGAAATTA CTGCCCTCAA TTAAATATTT TAACGCACCT 16 80 

TGGGTCATGC TATACATTTC rGTTTCTTCG TTTAAAAATT CGGCAATCTC TTTTAATAAA 174 0 

GCTAGTCGCG TCCTTTGTTC CATCAAATCG CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 

TATACATTGA GTTATTATAT TTTTAAATCT TAGACGTAAA CATGATAAAA TGGCCTTGAT 18 60 

TACTCAATAG TTATATTTCG GAGAACTGAT TTGTGATATG ATATTAAAGA CTATAGGAGG 192 0 

ATTTTATGAA ATTTAAAATA CCAGAAAACT TTAATGACTT AAGTTTACGA GATATTTTCC 198 0 

AACAACTTAA GGTACCTAAA AAAGATTTAC ATCATTTAAA TATGTCTAAA GATATTACTA 204 0 

TTAATGATAA ACCTGCGCGA TTAATGGATA AAGTGCATAC TGG CGACGAT GTATTTGTTC 210 0 

CAACCATCGA TGAAAAAAGT AATTATGTTC CAAGTTATCG TTATGCACAA ATTAAATACG 2160 

AAGACGATGA TATGGCAATC GTAATGAAAC CTAAAGGTGT TAAGACTCAC C CTAATG ATT 2220 

TAAAAGAAAG CAATACTTTA ATGAATCATG TGATTTACAC TATTGATAGT GACTATGTCG 22 80 

AACCAATTCA TCGACTGGAC CAGGAAACAG TAGGATTATT AATTGTTGCT AAAAAT C CTT 234 0 

TAATGAAAAA AATTCTTGAT CGCATGTTAG AAGACAATGA TATTACGCGG ATATACAAAG 24 00 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC CACAAACGAT TGATATGCCA ATTGGTAAAG 2460 

ATAAATTCCA TTCGAATAAA CGACGTGTGT CTCCTACTGG ACAGCGTGCA ATTACACACA 252 0 

TTTTAACTTC AAAAATGATA AAAGAAGCTG TGTGCCmACT TGAAATCAAG TTGGATACTG 2 580 

GACGTACTCA TCAAATwCGT GTGCATTTAG CTGAAATTGG TCACCCTGTT ATTGGTGATC 2640 

CTTTATATGG TGATTCAACG TTAAGACAAT TAGAACTTGA AAGTTACAAA ATAGAGTTTG 2700 

TGcATCCCTT GACTAAGGAA GTCATTTCCG TTTCTTTGGA TGACTAATTT GATTAGTTTT 2760 

GCATGATATT tAAACATGCA ATACCGCATT GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2 820 

TAGATGGAAA ATTATTAATT TTTTCAGATG TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 2 880 

GATTTTATAT TTTGATATTT TAGTAATTAT TCATTTTATA ACATCCTTGG ATAATGACTT 2 940 

GTAGTCTTTT TCAACTGCGT TACGTGTATC TATGGACAAT ACATGACATC ATAAGATTTT 3000 

TATCACAGGT TGTTTGGCCA ATACATGTAC AACAATTCAT CATATAAAAA ATAGGTTCTA 3060 

TAATAAAACG GACT CCATGA AAAGTTTTTC CTTTTCATGG CTCTATATCA AATCAGACTT 3120 
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CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGGATC TACCATATCT TCTGGTTTAA 324 0 

TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTGCAGAT T CTTTT AAAG 3 30 0 

5 

TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 33 6 0 

GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 34 2 0 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 34 8 0 

10 

AAATTGATTG TAGTGTATTA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC 354 0 

CTTGTGAACT TGCGAACCAA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 360 0 

, 5 TAACATTTCA CATTGTGTAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 3660 

TTCAGGGATA GAAATTTCTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 3720 

ATTAGCAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 37 80 

20 TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATA CAAATG GATAACCCGT 384 0 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GGATGCGCAT TAATAC CAGT 3900 

ACCAACAGCC GTACCACCGA TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 3 960 

25 TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 4 02 0 

CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 4 080 

TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 414 0 

30 

ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 4200 

ATCATCATTk GGGTGGATAC TTTCATCACT TTGATGATCT TTTAAATACA TATTAGCAAC 4260 

ATAACTTACT ACTTCGTTCA CATTCATATT ACTTTGTGTA CCGCTTCCTG TTTGCCATAC 4 320 

35 

AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 4380 

GGCATCTTTC TTTGCCTCGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 4440 

4Q CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 4500 

ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 4 560 

TC CAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 4620 

45 TTAACTGTAG TATATCATTA AATAATTTAA TTGAGCAATT TATGATTAAA ACGTTTTCAT 4 680 

AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 4 74 0 

TACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTAC GATA 4 8 00 

50 CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTGTAAGT ATATATAATA CAGCACTAAC 4860 

AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTGTTA AACTCGCATA 4 920 

55 
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TACAATACCT GATTGATTAC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 504 0 

TTTTGATAAA TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 510 0 

5 

AATTGCCAAT GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 516 0 

TTGAATCCAG TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 522 0 

GCATATCATA ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 52 80 

TGATAGATGA CTTCTTTCAA TTATGTCTGG AG TAATT AAT TATCAATTCC GTTTAAATGG 534 0 

TGTTTTAATA TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 5400 

JS TTTTATGCTT TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 54 60 

CGGAATGTCA GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 552 0 

CCCAATTTCT ACAGACATTG CAAATTGGGG AAACGGGCCA CAAACTCAGA AACTGGTGGA 5580 

20 AAGTCAGCTT AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 564 0 

AATTTCTACA GACATTGCAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 5700 

TCAG CTTAAA ATAACATGCA AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 5750 

25 TTCTACAAAC AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 5820 

TTCTATGAAT ATAATAAAAA AGCTAGGTAA CAAAATGCTA C CTAACTTCA TATTCAAGAT 58 80 

AATCAATCCT ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 5 94 0 

30 TATTACACAA TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 6 000 

TGATCAGAGA TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 60 6 0 

ATACTGTTTT CTTCTTCATG ATAATTTAAT TTTGGATCTT TGT CTTTAGA CATAATCAAT 6120 

35 

ACCTCACAGA TTTTTAATTA AGCAAAGCGT GACGTAAAGT AAG CTTTAAC ATCTTCAGGT 6180 

AAACCTGCAG CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 6 24 0 

4Q GCTGGTTCGA AGCTTGTTTT ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 6 3 00 

TTAGCACCTG TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 63 60 

ACATTTAACT TACCTTGCTC ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 64 20 

45 TCTTTAGTTT TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTTCATA AGCAATTGGA 64 80 

TAAACTTGAC CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 6 54 0 

AAAATATTTA TTTGGCTAAA ATCAACAGCA TGTTTTTCAA CATTTTTCTT TAATT CATCT 66 00 

50 AGAACTGGCG CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 6660 

AATTGCTTTC TAATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 6720 

ss 
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GATATTAATA 


ATTATTATA C 


CCTAACTTTC 


AATATATCAA 


ACCATTTAAC 


TTTAACATGC 




TTATACT CT A 


AATATAGCAC 


TTAAGCATCA 


TTTTT ATAAT 


GAAAATGAGT 


AAATTTTAAT 




TCAATCCCGG 


rAAATCTTGT 


TGACGTAACG 


CTTCATAAAT 


TAACAACGCA 


GCAGTATTTG 




ATAAATTTAA 


TGAACGAATA 


TGTTCACTCA 


TAGGAATTCT 


TAACGCTGTG 


TCTTGATATT 








AATCCTGTCG 




AAAAATGAAG 


TAAAAATCTT 




TGTCATGATT 


TGAAAAATCA 


AAATCACTAT 




ACCAAATTTT 


GTTAATAAGT 




AATACTCGCC 




TCAAAAAATG 


CTTCAATACT 


ATCATGATAC 


GTAATATTCA 




CAAATTCCCA 


ATAAT CT AAA 






ATCATCAGTT 


CTAAATCCAA 










TACCTGCACA 


CGTACGgCAA 


TGTTACCAGT 














TATTCTCTCC 




TTGTGTCTAA 






GAACTTCTGC 




T CATAATT AG 




CATTGATAAA 


ATCTCTTGCT 


TCTTCCCCAA 


GAATTTGACC 


AATGGCCCAA 


TAAGCAGTTG 




CTC GAATCAA 


CGGTCTTTCA 


TCTGTTGTTG 




CAATT CTGGA 


ATTGCATCCA 




CTTCATTAAA 


ATGCGCCAAT 


G C TAAAATAG 






TTACCACGCC 




AAGCACCTGC 


AAGgTGACCA 






ATTAGACATA 


CGTAGtAAAG 




GTACTAATCT 


TGGCTTTAAA 


ATTTCTGGTT 


CCAAAATGAT 


GTCATCTTGT 


TCGGTATTAA 


7740 


TACCTCTATT 


TTTCGGACAA 


ACTTGTTGAC 


ACGTATCGCA 


AC C ATATAAT 


CTATTCCCAA 


7800 


TTTTATAACG 


ATATTGGTCA 


GGCATATAGC 


CTTTTGTTTG 


CGTTAAAAAA 


CTAATGCATT 


7860 


TCTGACTATT 


TAATTGGCCA 


TTTCCAACTA 


ATGCACTTGT 


TGGACAACGA 


TCAACACAAA 


7920 


TTG TACAATG 


CACCACAGCT 


ATCTAATAAT 


GGATCATCAG 


GTTCC 




7965 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1302 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 6 0 

50 GG CAATT AT A GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 12 0 

ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGC TCTG AAATTGAGTG ATGTTTCATT 180 

ss 
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TGTAGTTGTT GAAAATATTT ATCGACGCTT AACAGATTCA GAAGAACAAC TAAAAGGTGA 300 

AAATTTAATT ATCAGTGCGA CAACTGAAGT ATTTAAACCA ATAATGTCAT CGACACTAGT 360 

5 TACTATTATC GTCTTCTTAC CACTTGTGTT TGTATCAGGT TCAGTAGGCG AAATGTTTAG 4 20 

ACCTTTTGCA TTGGCTATTG CATTTAGTTT ATTAGCATCG TTATTAGTGT CAATTACACT 4 80 

CGTTCCAGCG TTGGCAGCTA CACTATTTAA AAAAGGCGTT AAACGTCGTA ATAAACAACA 54 0 

10 

TCAAGAAGGA TTAGGTGTTG TTAGTACAAC TTATAAAAAA GTATTACATT GGTCATTAAA 600 

TCATAAGTGG ATTGTAATTA TATTAAGTAC ATTAATTTTG GTTG CAACTA TTGTATTTGG 66 0 

AGGACCGAGA CTAGGCACTA GCTTTATTTC AGCAGGTGAC GATAAATTTT TAGCTATTAC 720 

T5 

TTATACACCG AAGCCTGGTG AAACGGAGCA AGCAGTGTTG AATCATGCGA AAGATGTTGA 780 

AAAATATTTA AAACAGAAAA AGCATGTAAA AACAATTCAA TACTCAGTTG GCGGTAGTAG 84 0 

2Q TCCAGTAGAT CCAACGGGTA GTACAAATAG TATGG CAATC ATGGTTGAAT ATGATAATGA 900 

CACGCCTAAT TTTGATGTAG AAGCGGATAA GGTTATTAAA CATGCAGATG GCTTTAAACA 960 

TCCTGGAGAG TGGAAAAATC AAGATTTAGG AACAGGTGCA GGTAATAAAT CTGTAGAGGT 1020 

25 TACTGTAAAA GGTCCATCAA TGGATGCCAT AAAATCAACT GTAAAAGATA TTGAACAGAA 10 80 

AATGAAACAG GTTAAAGGAC TAGCCAATGT CAAATCTGAT TTATCGCAAA CATATGATCA 114 0 

GTATGAAATT AAAGTCGATC AAAATAAAGC GGCAGAAAAT GGTATTTCTG CAAGTCAACT 1200 

30 TGCAATGCAC TTGAATGAAA ACTTACCAGA AAAAACAGTT ACGACTGTTA AAGAAAATGG 126 0 

TAAAACTGTT GATGTTAAAG TCAAACAAAA TAAGCAAACA GC 1302 
(2) INFORMATION FOR SEQ ID NO: 323: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 6 0 

45 

TTAAACCACC TT CAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG 12 0 

TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT 180 
so TAGTCCAGTC ATTTGTGAAG TATGCTG CTT CATTTTCATA AGTATCAATC GCAGCACCTA 24 0 

ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCACCACGTG 3 00 

£5 
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GAT AG CTTTC TTTGTTCGCT GGAACATGTA AAGAAATAAT ATCGGCATCT TTAATAGCTT 4 20 

CTTTAACACT ATCTTTATAA GTTAAAAAGT CTAAATCTTT ATTAGGATAG GCGTCATAAG 4 80 

CTGTAATTGT AG CAC CAAAT CCTGCATATA TtTTAGCTGT AGCAGCACCG ATACGACCCG 54 0 

TACCAATAAT TGCAACAGTC ATATTTTTAA CTGGTTTAGA CATGATTTCT GCTTGCCAAG 6 00 

TAAAATCATG TGCTTGTACA CGGCGTTCAA TATCTGGGAA GCGACGCACT AATTGTAGGG 66 0 

CGATAGArAC AGAATACTCT GCAATtGTTT CAGGTGAATA ACTAGGAACG TTAGATATCA 72 0 

CAATATTGTG TTTTTTAGCT AAATCTAAAT CATACATATC AAATCCAG CA GTACGTTGTG 78 0 

CAATTTGTTT AATACCGTAA GATTCTAATT TAGGATAAAC GTCATTTTCT AACTTACCAA 84 0 

ATTGCATTGT AGTTACGCCA TCGTAATCTT TTAATTGAtC GACTGTAGCA CTTGATAATA 90 0 

GCTCTTTAGA AGTAGTTACT TCGACATTAT TCTtTtTCCC CCAATTTAAT GCCATCTCTT 96 0 

TCTCATAATC ACGCGTaCCA AAGAACATAA TTCTCGTCAT TAT 1003 



(2) INFORMATION FOR SEQ ID NO: 3 24: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 5030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
GGCTTTTATA GTGTTTACGA TTATTATGAT TACTTATATT CACGCTGTTA AAAATTATCC 6 0 

35 AAATAACCGT ACTGTTCATT ATGGTTATAC AGCTGCATTT ATACTTGTTA TATTACAAGT 120 

TATCACAGGT GCATTATCTA TTATGACAAA TGTTAACCTG ATAATCGCAC TTTTCCATGC 180 
ATTATTTATC ACTTATTTAT TTGGTATGAC AACATACTTT ATCATGCTAA TGTTACGATC 240 
AGTAAGAAGT GACArGCAAT AACaAAAAAG cmAACCGTAA TTTTAATGG C ACGCCCATTA 3 00 

AAATTACGGT nTTTTATATC AATATTTAAA AtTAAACCtA AGCCATGTAA AAACGAGATT 3 60 

45 ACACGTCAAT TGTTGTGTAA TCTCGTTTTA TnTTAATCAT TTTAGTCAGT TGCTTTTTCA 420 

ATTTCGATTA ATAAATCGCC TGTCGCTATT GTGTCACCAT TATTTACAGT TACTTGTTTA 480 
ATCACACCGT CAAATGGTGC TTGAATTGTT GTTTCCATTT TCATAGCTTC AGTAATTAGC 54 0 

so AACGG CTGAT TAGCTTTCAC AGTTTCACCT ACACTAACCT TGACTTCAGT TACTGAACCT 600 

GGCATTTGAG CACCGATATG ACTTGGATTA CTCTTATCTG CTTTTGGCTT AACGTTCGCA 66 0 

TTTGTATGCA CATTTTCATC TTTAATGTAA AT ACGTCTC G CTTGACCATT CAT CGCATAG 720 

55 
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CGTTTACCTT TATCGATTTC GATTTCTACT GTTTCACCAT TACGCATTCC AAAGAAGAAT 84 0 

GTAGGCGTAT CAAGTAACGA TAAGTTTCCG TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

TATACTTTTG GATATAGTAC ATAACTAATA ATATCTTGCT CCGTAACAGG ACCTTGTTGC 96 0 

TCTTCTTCAA GCAACTCACG GACTTTTTCA AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA ATAACCGCTT GTAAAT CTTT ATTAAAACC7V 1080 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG AACGACACTA CTGATTCTGG GAAATCTAAT 114 0 

TTATAGCCAT CTGTAATCAC GGATTGTTCA TCAAGATCAT TTTGTACCAT ATAAAGTGCC 1200 

ATATCACCAA CTACTTTAGA CGATGGTGTT ACTTTTACGA TATCACCAAA TAAGAAATTC 126 0 

ACTC TGCG AT ACAT AT CTTT GACTTCATCA AATCTTTCGC CTAAACCTAA ACTTTTAGCT 13 20 

TGTTGACTTA AATTCGAATA CTGTCCACCA GG CATTT CAT GTTGATAAAT TTCAGTATTC 1380 

GGTGATTTGA TATCACTTTc AAAGTCTGAA TAATAAGTAC GTACAGTTGA CCAATAATGA 14 4 0 

CTAAGTGACT CCATACCTTC AATATCAGTT CTAAGGTGGC GTGGGAAGCC ATTTAATGCA 1500 

TAATATAACG AATTGGCGCT TGGCTGACTT GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

TCAATGATAT CGACACCAGC ATCTATTGCT TG TTTGTATG TTAATAAACC ATTACCACTT 1620 

GTAT CATG AG TGTGAAGATG AATTGGTAAA TCTACAGCTG ATTTTAACTC ACCAATCAAT 1680 

TCGTAAGCGG CTTTAGGTTT TAATAAGCCT GCCATATCTT TAATCGCTAA AATATGGAAA 174 0 

CCTTCACGTT CTAACTCTTT AGCTAGTTTG ACATAATACT CTAAAGTATA AATGTTT GAT 1800 

CGCTCAGGAT TTAAAATGTC ACCTGTATAA CAAATAGTAC CTTCTGAGAT TTTGCCC GCT 1860 

TCTTGTACTG CTTCATTGGC AACTTTCATT TGATCTACCC AGTTTAATGA ATCGAAAATT 1920 

CTAAAGACAT CTATGCCTG C TTTAGCACTT TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GGATAGTTTT TATAACCAAC TGCGTTTGAA GCACGTAACA ACATTTGGAA TAATACATTT 2040 

GGAATAGCTT TACGTAGACG TTCAAGTCGT TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

GCCACATCAA ATGTAGCACC GCCCCACATT TCTAGTGAGA AACCAT CTTT AAATACGTCC 216 0 

GCTGTTTTGG ATGCGATATT AATCATATCC TTAGTTCTAA CTCGTGTAGC TAATAATGAT 2 220 

TGGTGTGCAT CTCTAAAGGT TGTATCTGTT AGTAAGACAT CATCCTGCTT TTTAACCCAT 22 80 

TCAGCTACAC CTTTTGGACC TACTTCATCA AGCAATTGTT TCGTACCACT AAATGAAGCG 2 34 0 

ATTTTACTTG AAGATACAGT TGGAATTGAT GCTAATTCAT AGTCTGGTTT CGGACGTTTC 24 00 

TCAACATTTG GGAAACCATT AATTGTTACA TTACCTATAT ATTCTAATGT TTTAGTACCT 24 6 0 

CTATCTAGAG ACGGCTGAAT GTCGAAAAGT TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2 520 
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TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 2 64 0 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG TGATATCTCA 2 700 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2 75 0 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACGA 2 820 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 28 8 0 

CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2 940 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 3 000 

TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTGCATTGA CATATTTAAT ATTTTCCATC 3 06 0 

AATTGAATTG CAGCATCACA AATA CGTTGA CGTAATG TTG ATGATAATCC AACTGATGGT 3120 

GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT 3180 

ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 324 0 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTT CTGA TTTTGCTCTA 33 00 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 3 3 60 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTG CAAATTC TTTTGCTAAT 34 2 0 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 34 80 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 3 54 0 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 36 00 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 3 6 60 

TTTAAATAAC TTTCAGCAGG AC CT AAATCA CTTCCAACTA AATAGGATTC ATCTGCTTTA 3 72 0 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 3 780 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 3 84 0 

TTTATTTGTT TCAATAGCGA TACACTCCTC AAACTATTAG AATTTTCTAA CTAATTAGAT 3 900 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTTCmAA TTTTCAAAaT 3 960 

TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 4 020 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4 080 

AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAG CGAT AACTTTATCT 414 0 

TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 4200 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4 260 

AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4 320 
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TCTCGTACTA AATATTGGCT AGTATTTTTT TAATTAAATT GTCTTCTTAT ATCAACTTTT 444 0 

TGTTGTTGTT TCTTTCGTTG CTGGTCTACT TTGATTTGTT TACCTACAAT CAG AAG T AAA 4 50 0 

5 CCCATAGCAA TACTTAAACT AATCATTG AT GATCCACCAA AGCTGATAAA TGGCAATGGC 4 5G0 

ACACCAGTTA ATGGAATTGT TGCCGAAATA CCGCCAATGT TTACAAACGT TTGACTTCCA 4620 

AAGTATGTGG CAATCCCAAC ACACACAAGT TTATAAAAAT ATGATGATGT TTTATTTGCA 4 68 0 

AACTGGAAGG CACGATATAC AATAAAGAAC TCTAAAGTAA TAACTAGCAA TCCTCCGATT 4 74 0 

AAACCTAATT CTTCG CAAAT AATTGCAAAA ATAAAATCTG TATGTGGTTC TGGTAAATAG 4 800 

^ CCCAATTTCA TTGCACTATT TCCTAATCCT TTTCCAAATA CGCCACCGTT ACCTATCGCA 4 86 0 

AGCAATGAAT TGGAAATATG GTATCCAGTT CCTGATTCGA ATTGGAATGG ATCTGTTAGC 4 920 

GTACTAAATC TGGCAGTTAA ATAACTTGGT AACCAACCAG CCATTAATGC AATGACAAAT 4 980 

20 ACTACTAAGA ATCCTAGCAC TGCTGGTATA CCAAATCTTA GGACTTTGTT 5030 

(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 1389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 



CTTGTTAATC 


CGAAAATTAT 


TAGTCAATCA 


AATG AAA CAA TAACAGACTT 


nGAAGGTTCA 


60 


ATTACATTGC 


CAGATGTTTA 


CGGCGAAGTG 


ACAAGAAGTA AAATGATAGT 


TGTCGAAAGT 


120 


TATGACGTCA 


ATGGGAACAA 


AGTTG aACTA 


ACTGCACATG aAGATGTAGC 


AAGAATGATT 


180 


TTGCATATTA 


TAGATCAAAT 


GAACGGTaTC 


CCTTTTACAG AACGTGCGGA 


CCGTATTTTA 


240 


ACAGATAAAG 


AAGTGGAGGC 


ATATTTTATA 


AATGACTAAA ATAATATTTA TGGGTACACC 


300 


AGACTTTTCA 


ACAACTGTTT 


TAGAAATGCT 


TATTGCAGAA CATGATGTCA 


TTGCAGTCGT 


360 


AACGCAACCA 


GATCGACCTG 


TTGGACGTAA 


ACGTGTTATG ACACCACCAC 


CAGTTAAAAA 


420 


AGTTG C AATG 


AAATATGATT 


TACCTGTATA 


TCAACCTGAA AAATTAAGTG 


GATCAGAAGA 


480 


ATTAGAACAA 


TTG CTTCAAT 


TAGATGTAGA 


TTTAATTGTA ACTGCTGCTT 


TTGGACAATT 


540 


ATTACCTGAA 


TCATT GTTGG 


CATTAC CAAA 


TCTTGGGGCA ATT AATG TAC 


ATGCATCATT 


600 


GTTACCGAAG 


TATAGAGGTG 


GTGCACCAAT 


TCATCAGGCA ATTATC GATG 


GTGAACAAGA 


660 


AACCGGCATA 


ACAATTATGT 


ATATGGTTAA 


AAAATTAGAT GCGGGTAATA 


TTATTTCGCA 


720 
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ATTAGGGGCA GATTTATTAA AAGAAACTTT ACCATCTATT ATAGAGGGCA CAAATGAAAG 84 0 

CGTAC CTCAA GATGATACGC AAGCAACATT TGCTTCCAAT ATTCGACGCG AAGATGAGCG 900 

AATTAGCTGG AATAAACCAG GAAGACAAGT GTTTAATCAA ATTCGTGGAT TATCACCATG 96 0 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA ATATACGATG CTGAACTCGT 1020 

TGAGACTAAT AAGATAAACG AGCCTGGAAC CATTATAGAA ACGACTAAAA AAGCCATTAT 1080 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT ATGCAATTAG CTGGGAAAAA 1140 

GAGAATG TT A GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAGAAACT 120 0 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT ATTAAATGAA 12 60 

GGTGCGTATA GTAACTTGCG TATCAATGAA GTGTTGTCAG AAAATGAATT AAATG CAATG 132 0 

GATAAGGCTT TATTTACAGA AATTGTCTAC GGAACCGTTA AAAGAAAATA TACGTTAGAT 138 0 

TTTTATTTA 13 8 9 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT 60 

AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 12 0 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AAAACTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAGAAG ATAATAGTTA CTATTATCCA GTTATAAAAG 24 0 

ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA TAAAGATGAT TTAAACAAAT 3 00 

CAAAAGAAGA CATGAATTAC AGTGTTAAAA TTTCAAACTT CAT CGCTAAA GATTTAGACC 3 60 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 420 

aAGAAGATGG cmAAGTTAGA TTAGTAAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 4 80 

AAGAAAGTGC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 54 0 

CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTTCAATAT GAAAATACAT 6 00 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATT AAATG Ca ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATG C 720 
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AAATGATTGA ATACGGTAAA TCACAAGGCA GAGATATTCA TTATCAAGAA GGCGTACCAT 84 0 

CATATGAACA AGTTGATCAA CTTACAAAAG ATAATGTAGG AATTATGATC CTTGCACmAA 90 0 

GTGTATCTCA AAA C C CTAAT G AC C CACATT TAGGACATGC GCTAGCAGTT GTTGGTAATG 96 0 

CTAAAATTAA TGACCAAGAA AAACTTATTT ACTGGAATCC TTGGGATACA GAaTTATCAA 102 0 

TCCAAGATGC AGATTCAAGC CTATTACATT TATCATTCAA TCGTGATTAT AACTGGTATG 10 BO 

GTTCAATGAT AGGTTACkAA AAAGTAATAT AGATATTGAT TAAAGGCAGG TAAAACTATG 114 0 

TATCAACTAC AATTTATAAA TTTAGTTTAC GACACAACCA AACTCACACA TCTAGAACAA 120 0 

ACCAATATCA ATTTATTCAT TGGTAATTGG AGTAATCATC AATTACAAAA AT CAAT TTGT 126 0 

ATACGTCATG GCGATGATAC AAGTCACAAT CAATATCATA TTCTTTTTAT AGATACGGCA 132 0 

CATCAACGCA TTAAATTTTC ATCTATTGAT AATGAAGAAA TCATTTATAT T CTTGATTAT 13 8 0 

GATGATACAC AGCATATCCT CATGCAAACG TCATCCAAAC AAGGTATTGG CACTTCGCGA 14 4 0 

CCAATCGTTT ATGAGCGCTT AGTATAACTA ATTTAAATGA TTTCACTTCA TAAAGCGGGT 150 0 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT TTTTTCATTG TAATAATAAT CTTTAACATT 156 0 

TATTCTTTCT CTATTAATTT TTCTCAAACT ATCTTATCTT TATGATAATT AATTAAAATG 162 0 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC CACCTATCGT CGCTAATAAA CGACGCAAGT 168 0 

GACTTAATAT CATATTCAAA ATAACTTATG GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

CACGTTTTTC TTTAAACGCA TCACGGCCTT CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 180 0 

ATGTTGCATC CCCAGCCATT TGTTGTAAAC CAGCTAAACC ATCTGTGTCA GCATTCATAG 186 0 

CTGCTTTAAG GAATCGTAAC GCTGTTGGTG AGTG TTT CAT AATCTCTTTA CACCATTGCA 192 0 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA CCACTGTATT TACTAGACCC ATATCTAAAG 1980 

CTTCTTGTGC ATTGTATTGA CGACATAAGT ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2 04 0 

CGATACGTGC TAAATAT CCT GAACCATAAC CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TTTGTCCAAA AATAGCATTA TCAGCAGCAA TCGTTAAGTC ACAAACAACA TTTAGTACAT 216 0 

TACCGCCACC TACAGCATAA CCTTTTACCA TCGCGATAAC CGGTTTTGGA ATAATACGAA 2220 

TTAAACGCTG TAAATCTAAT ACATTTAAGC GAGGGATTTG GTCTTCACCT ACATAACCAC 2280 

CATGTCCACG TTTCTTCTGG TCACCACCAG AACAGAATGC TAAATCACCT TCACCAGTTA 2 34 0 

ATACGATAAC TGAAACGTTT TGATCATCAC GTGCACGTGA AAATGCGTCA ATCATTTCAG 24 00 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 24 6 0 

TCCCTTCGTA AAA TT CAT AT TTGATTTCAT CATATTCTCT AAGTGTTTCC CATTGTCTGT 2 52 0 
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TCCACATGAA TTGTATGACC TGTAG CAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA 2 64 0 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 2 700 

GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 274 6 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 900 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 
TAATGTTTAG TTTATTAACA GTAAGTTCGT ATATCAATGT TTAGTGCTCC CCAAAATTGA 60 

AGTTTGAATT TTAAAAGCAT CTTGTAGAAT TTAGTTGTAT TTTTTTCAAA GAAATTCATT 120 

TTGATTATTT TTGATAATGA GCATTTTAAT AGTAATACAT GTTTATAGTG TGTAGTATAT 180 

GTCTATACTA GTAGTAACTA TATAGAGAAA GTAGGAATAA ACTATGTCAC AAGATGTAAA 24 0 

TGAATTAAGT AAGCAACCAA CGCCAGATAA AGCAGAAGAT AACGCATTTT TCCCATCACC 3 00 

ATATTCCCTT AGTCAATATA CAGCACCTAA AACAGATTTT GATGGTGTTG AACACAAAGG 3 60 

TGCCTATAAA GATGGTAAAT GGAAAGTATT GATGATTGCT GCTGAAGAGC GATATGTATT 420 

ATTGGAAAAT GGAAAAATGT TCTCTACGGG TAATCATCCT GTTGAAATGT TATTACCTTT 4 80 

ACATCATTTA ATGGAAGCAG GTTTTGACGT TGATGTTGCG ACATTATCTG GTTATCCAGT 54 0 

TAAATTAGAA TTATGGGCTA TGC CAACTGA AGACGAGGCA GTTATAAGTA CTTATAATAA 600 

ATTGAAAGAA AAATTAAAAC AGCCAAAAAA ATTAGCAGAT GTGATTAAAA ATGAATTAGG 6 60 

ACCTGATTCA GACTATTTAT CTGT CTTTAT CCCAGGCGGA CATGCTGCAG TTGTTGGTAT 720 

TTCTGAAAGT GAGGACGTTC AACAAACATT AGATTGGGCA TTAGACAATG AC CG CTTTAT 780 

AGTTACATTA TGTCATGGAC CAGCAGCACT ACTTTCAGCA GGGCTTAACA GAGAAAAATC 84 0 

TCCATTAGAA GGATACTCTG TTTGTGTCTt CCCTGaCTCA TTAGATGAAG GTGCAAATAT 900 
(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 64 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC CcAATTTTaA AcCtTTATTT TTaAGcGtTT 60 

CaTCaGCTAA CaCtTCaTTa TCtTTAGtCG cTTTATGcCC tTCtATTAAA CtCGGaACt A 120 

AAAATGaTGa CtTTTCAACa CCGAACaCTA AAACaTTGtC tTTTTGATGG CCaTTAGACA 180 

CAATTTCCCC TGTTTGCTTC AAAGTAGCTT GCTTCTTGTA TTTATTTTCA ATATCTTTCT 24 0 

TGTTAAAAAC AGATTGTTGC ACAGTTTGAT TGGCATCTTT ATTTAGAACA ATGGCATCTG 300 

CTTGCCACTT ATCAATGCCT TCTTTATTCA TATTGATAAG ACCATTCGCC AATCCAGATA 360 

ATAAAAATAG CAAGTAACTA ATCATCGTTA ACACAC CAAT AATTAGTCCA AACTTCAATT 42 0 

TGTTGCGCCG TATTTCATTC CAAGCTAAAA ACATGCATTT CTCTCCCTAC TACTATGATT 4 80 

TAAACATTGT TTATATTCTT AGATGCACGT ACGTCGTGTT GCGCTCTGTA ATGTTATACA 54 0 

TACACTTATC CTTCATTATA CCCGAACTTT TTATATTAAA AG CAAATTTA TGGAAAATGC 600 

AATTAATTGT CTATTATTTT TGTACGGTAC ATTTAAAATT AAGGATCAAT TTAAAAACGC 660 

CTACATATAC CTTTAAGTAC ATGTAGACGT CCAATTCATA TATTATTTAA CTTCGCCTGT 720 

TTTAGGATCG AATTGCTTAA TAGCATTTTT ACGTAATTTA TCTTTTGCTT TrTCACTTGC 780 

TTTATAGTTA TTGTTGTAAA TCGTAGCTTC CCAACTACCA TACATTGGGT TAGGGAAAAT 84 0 

GATATATTTC TTACCGAAAT CGTCTTTATG TTTTTCAATT AATGCTTCAC GAGATTCAGC 900 

TGTAGCTTCT TTTGGATCTG TAAAGTCTAA TAAATTATCT CCAAATAGCA TGACAAGTTT 960 

ATGATCCTTT TGAACCATTT GTCTGCGTGA TTCTTTACTC TTATCATCTT TACCTTTTAG 1020 

TAAAATATGA CTCTTCTTAG CTTGAGGGAT ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 1080 

CTTTAAATCT TTTTCTTTAT CTCTATCAGA AATATAGTAG ATATCGACAC CTTTTTTGTC 114 0 

AGCATATTTC AAGAATTCTT TTGCGCCATA GACAGGTTTA GCTTTAGCAG CTTGTACCCA 1200 

TTCATGCCAA CCTTCTGGGA AAGGTTTATT ATGTATTGAT GCATAGCCTT GATATGGAGA 12 60 

ATTATCTAAA ACTGTTTCAT CTAAATCCAA AGCAATAGCT AACTTATGTT TACCTTTATT 1320 

CTTTTTAATC TCTTTATCTA ACTGTGTCTT TGCACTGTTA TAACCTTGTA AATATAATGC 13 80 

TTTTGCTTCA GCTGAATTTT GATACCAAGC CACTGCCATA ATATTTTGAT TACCAAGATT 1440 

CGCCTTTTGT GATGCTGGTA TAGAAGCTTG TTGCGTTTGT TGAACTTCAG CAGAACTTTT 1500 

GGCAAACGCT GTAGAATTTG TCGTTTGTGG TGCTGAAACT GTAACCGCTA CCGATAATGA 1560 

TGCTATTGCA ATATACTTTG AAATTTTATT CATCTTATCA CCTCATGATT AATATTTAAA 162 0 

ATACAGTTAA AATTATAAAT GCATTTATTT AATATTGCTA TACTATGAAA AGATATTTAA 1680 

TATTATTTCT TGgAAAAGCT AACAAATATG TGAACATTTC ATATAAGCAT GATTTAATGG 1740 
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GTGACTAGGC CTTCCTATCA GACATATTCA 
GCTATCTCTT ATTTACCTAT TGGAACAACC 
5 ATAGCAGTCC TCACTCATAC AATTTCTCAT 

CTTACTTTCA CCTATTTTAA TTCATAAAAA 
GCAAGACGTT ATTATTGCAA TCTTTTTTAT 

10 

AAGATAAAGT AAGAACGTTT TTATTTTTCA 
TTAGGAAGGA AATTACATTG AAAAAACAAG 

)S TATTTTTTGG AGCCGGAAAT TTAATATTCC 

ATATGTGGAT TGGTATGCTA GGCTTTGCCC 
TTATTGTTGT TGCATTTTAT GATGAAGG TG 

20 GGTTCGGGTT TATTTTTGCT GTCGTGATTT 

CACGTGCTGC AAATGT CGCG TACGAAATTG 
AATGGACTTT AATTATATTC GCAGCAATCT 

25 

ATCCATCGAA AATCGTTGAT AATTTAGGTA 
TCGCTCTATT AAGTATTGCT GTCATTTTCA 
ATAAATATAT AACACATCCT TTCATTTCAG 

30 

TTGTTGCTGC GTTAG CTTTT TCCGTAGTCA 
CAGATCGCAT GAAAATTTTA AAATATGTCT 

35 TTGGAATGAT TTACTTTGCA CTTGCATACG 

AAGATGGTAC AGATATATTG ACGTACAACT 
TCGTATTTGG AATGACGGTT ATCCTTGCAT 

40 tTGCGCCACA TTTACTAAGA AACACGTACC 

TTTCtCTATC ATAGGGTTCT TATTTACAAC 
TGTCCCATTA TTGACTTTAA TATATCCCGT 

45 

TAACATGTTT AGCACATTCA G ATT CAGTTG 
GATTATTTCA ATTTTACAAA TACTAAATAG 
50 ATCGTTTATG ATGTTACCTT TAGCAGATAT 

CTTTGCTATT ATCGGTTTCA TAATCGATGT 
TTAATAAATG CTCACTGCCT AGTAATGATT 

55 
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CTCATCCACG TATCATTATG TGTACAGTGT i860 

ATAAACTCAT CCATAGTTTA CCTTTTATAA 192 0 

AAAAATCACA ACGCTCCAAC GTATTT CCAA 19 80 

CGACACTTTA ATTGTCATTA TCCAATAATA 2 04 0 

AAAATAATAG AATCATAGTA TTGTCATTTA 2100 

GATTTTTTAA ATTATTATGA ATATCTAGTT 2160 

TTATTATTTC GGGCCTCATG TTATTTTCAC 2220 

CGCCCATGCT TGGC CATACA GCGGGTCAAA 22 80 

TTACAGG CAT ATnACTCCCC TTTATTACTG 2 34 0 

TTGAAAGTGT AGGCAATCGT ATACATCCAT 24 00 

ACATGTCTAT CGGAGCATTT TACGGTATTC 24 60 

GTACAAGACA CATTTTACCT GTGCATAACC 2 520 

TTTTTGCCAT CGTTTACTGG ATTAGTTTAA 2 580 

AATTATTAAC ACCG TTATTA CTATTAATGG 264 0 

A CCCTGAATC TGCACTAAGT GCACCTAAGG 2700 

GAAGTTTGGA AGGCTATTTT ACAATGGATC 2760 

TTGTCAATGG CTATAAGTTT AAAGGCCTCA 2820 

G CTTTT CAGG TCTTATTGCA GCCATATTAC 2 880 

TTGGGGCATC AACAGCTCCA GGAAACTTTA 294 0 

CATTACGATT ATTTGGTTCG TTCGGTAACC 3000 

G CCT AACAAC ATGTATAGGA CTCGTcAATG 3 060 

TAAGTTTTCT TATAAAATAT TCGCACTTAT 312 0 

ACTTGGTTTA GAAATGATTT TAAAAATTGC 3180 

GTCGATTGCA CTTGTACTCA TATCATTTGC 3 24 0 

GGCCTATCGA CTCGCAACTG TTATTACATT 3 300 

TTTCAACTTA TTACACGGTG TTATTTTGAA 3 360 

CGATTTAGCT TGGCTTGTAC CATTCATGCT 34 2 0 

ATTTATACGC CGTCCGAAAC AAG CGACAAC 34 80 

GACCCATCGT TACTAgGCTT TTt t ATATGA 3 54 0 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 
(2) INFORMATION FOR SEQ ID NO: 32 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

r5 TTATTGATAT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 60 

ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GGTATATCAT 120 
CAACACAATT TGTATATAAT AACCTGTCTG TAAG CGGATT TAAATATACT AATGATGGTA 180 

20 AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 24 0 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT 300 
ATGGTAGCCC CCTAC CATT A ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 3 60 

25 

ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 420 
TATTCAATAA AATTAATGAT CGATATATGT CAGAC GT AAA ACAAGATTTC ATTTTCCATA 4 80 

30 ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 540 
TTCAACATTT GCTTCCAATA AGCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 6 00 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 6 60 

35 CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 720 

TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 780 
TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA 840 
AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 9 00 

TATAATGATG GTCTTACTTT ATATAC T AAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 960 

4s ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 1020 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 10 80 

TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 1140 

so AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 12 00 

ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 12 60 

TGTTAAATAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATTCAATT 13 20 

55 
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TTTAAATTAA TTTTATGTAA TATAAATACT G CATTTG CAA ACTGTTGCAC TTTTAGGTAT 144 0 

AACAGAATTA ACTACATTTA AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 1500 

5 CTTGGTATTA GTTTAATCGC CATCATGTTA ATTGTATGTA TTGTATTAGT AATCATGATG 1560 

ATGACTGGCG GAAAGAAAGA TACATACTAT GGAATTATGA AAGATAATAC T A CTATTGAA 1620 

AAAATGATTA GTGAAAAAGA TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 16 80 

10 

GATGTTAAAG TTAAAAAAGG TGATTTTGTA ATTGTTTATA AATTAGCAGA TTCAGATAAA 174 0 

ATTGTTAAAG TTAAAAAAGT TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT 1800 

1S CATGACATGG GCAAAATGCA CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTT 18 60 

GCCATCAAGG TATATACGAG TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 1920 

TACTGCTGCT TTTTATATTT GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTCGCC 198 0 

20 TgcgCTTTGC GCATGCATAA AAGCCCCTAA CAACCTGAGG TCACTGCGCT CCGGTTCGCC 2 040 

TGcgCTTTAG CGCATGCATA AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 2100 

CTGCGCTTTA GCGCATGCAT AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 2160 

25 

CCTGCGCTTT AnCATGGCCA TAAAAGC 2187 

(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 
30 <A) LENGTH: 178 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

CCnCCTTTTT AAACCTGGnG AAATGt mCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT 60 

40 GCTTAAGATA GAGTGTAATA CGCTCCTGTT GTTCTTTATA GTATATTGCT TTTTGTTTTT 120 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 24 0 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGATTAT AAACCATAAC 300 

ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 360 

SO AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTA CTAA CAT TAACAAAATC 420 

GATAGCTATT ACATT AAA TG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 4 80 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 54 0 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 66 0 

TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTATAAAT CGTACTACTT 720 

5 

TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 78 0 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 84 0 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 900 

W 

CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 96 0 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 102 0 

15 TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AGACAACTTT TGCCCCTTCT 108 0 

TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 1140 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATTCTTTTTA TTTTTTCAGG 1200 

20 GTAAATCATA AATA CAT ATT ACTTTTAAAA AGCGTATCAC AATTCATATA ACGGTCATAA 126 0 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC ATTCGAAACA TACAATTAAT 132 0 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTTCTT TATTGTAAAA 1380 

25 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 144 0 

GAG GAAT AAA ACgTGACATA TCATGAGCGT GTTTTAGCAT TAAGAGCAGA AAG TAAAAG A 1500 

30 ACCGCATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG AAGAATGGcT AAGTATGTCT 1560 

CTTG CAG AAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 162 0 

TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 16 80 

35 CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTTGAAAA TGGAAGGATT CTAATATCAT 1740 

TTCGTTTATA TATnGCAGAC CATGGATAGA ATTTTnTATG GTnAATCC 1788 
(2) INFORMATION FOR SEQ ID NO: 331: 

40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
4S (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

50 TAAGCCAAAT CCATTGTTAC CAG AAG AAG T TCGCTTCATC TTAAAACAAA TGGGTCTTAA 60 

AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 12 0 

ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 180 

55 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 300 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGT C ACGA 3 60 

5 AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 42 0 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 4 80 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 54 0 

10 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGCAGGT TTAATTATTC CGGTAGAAAT 6 00 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 6 60 

, 5 ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 72 0 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 78 0 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 84 0 

20 TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAG CAGATAA 900 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 96 0 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATG TGGG A 1020 

25 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 10 8 0 

TCAAGAAGCA G CT AGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 114 0 

3Q ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTC C GTTT 12 0 0 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 126 0 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 132 0 

35 AGCAGCAGGT GCTGACTATG T 1341 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 

so ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATA CTT TkACTTCATC 12 0 

ACCAGCAACA ATTTTATCGT ATTTTT G CTT TTGTACATTT TGCTTTATAG ATTTCGCTTC 18 0 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 24 0 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCT CTATT AGAGACTTTT CATAATTTTG 36 0 

GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 

TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 480 

TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 54 0 

CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 

AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 660 

TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 

CCAATTTTTC TAACATGATC TAAAATGCTC ATTG CTAATG CAGCACCTTC ACTCGGATCT 78 0 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 84 0 

TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA 90 0 

TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGGGA 96 0 

ATCAACAATC CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 1080 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG TAAATATACA 114 0 

GTACGGTCCT CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT 12 00 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 12 60 

GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 1320 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 138 0 

TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCGATA TTCAGCTTTG 144 0 

ACAGG TATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 156 0 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATCA GTTAACACAG GTAATTGATT CATCTTGTCA 1680 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 174 0 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 1800 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 18 60 

AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 1920 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA 1980 

GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 204 0 
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TCACGCGATA ATGCGTTAAT CACTCTATCT TTTGTtACAA ATCCTTTTTG CGCAGTTGtA 2160 

CGCCATAATT CATAAAATCT AAATGATTTG TATGATGCGC ATCAGTGTTA ATAGTTAATT 2220 

5 

TCACATTTGG ATATTTACGA A CG ATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 22 80 

TAATTTCTAA TACTGTATTC GTTT CTTCAG CTAATG C CAT TAATTGTTCA ATATTCGGTT 2 34 0 

w TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CGCACGTATG 24 00 

GATTGCGACA TGCATTAGCT AATCGTTCCA TAATTTG TTC TTCTGATTGG TTAAAGCTTT 24 6 0 

GATGAATAGC TCCAATTACA TAATCAAGTT GTG CT AAAAT TTCATCATCA TAATCCAGCG 2 520 

'5 AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCA CTATATTCTT 2 5 80 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAAAGTCT TTCTACTTGT AAGCCATTAG 2 64 0 

CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCGATAT 2 7 00 

20 

TTGCTTCTAC CATGTCTCGA ATAGAAAACG CACCATCACT ATACGTTGTA TGCATATGAA 2760 

TAT CACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 2B20 

CATCTTCTCG CATAGCAGGT GGTATAAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 2 8 80 

25 

CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT TCCATATTCA CTTACTTTTT 2 94 0 

CATCACGTGC TTTAGCAAGT TGTCGAATTC TTATATTATG TTCTTTTGAC CCAGTAAAAT 3 0 00 

30 GCTGCAATGT ATGATAAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 30 6 0 

CATCATCATA CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCTT TTACTT 312 0 

TATTGGGAAT ATTTAATAAT TGCTGCTGCA CTGCTTTTGG GTTATCGGTA CTTATTATGA 3180 

35 AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 324 0 

CGATATAATT T AATGTAT CT ATATAATCAA TGATTTCTTG ATTAAGTCTT CTCATTTGAT 33 00 

CAATTGGATA TCTATCTTTC TTAGCACCAA GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 33 60 

40 

CCGTTTTCTT AGCAAATCCG CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 34 2 0 

GTGACGCTTT ATCAACAATA TTCAACTCTT TATATAG CTT AGCAATTTTC TTGCTTCCAA 34 8 0 

45 GTCCTTGAAT TTTCAAAAGT GGAATAAGAC CTTCCGGAAC TTCTTCCTGT AATTGCTGTA 354 0 

AATACTGAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA 36 00 

TGCCTTTtAA CTCCGTtACA TCAGATATTT CATCTAATGG TCGTTCATCT AATTCAAGAC 36 6 0 

so TTTGAGCTGC TTTTCGaTAC GCTGaTATTT TAAAAGTATT TTCCCCTTTT AATTCCATAT 372 0 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGT CATAATA ACACTCCATA 378 0 

AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 3 84 0 
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TAGATATTTT AAGCTGATGT TGTAATGCTT CGTTAGGATA TAATGCCAAT AGATATAACG 3 96 0 

TAAAGTGTAA GACAATTATC GTCATAAACA CACCAACTAT CATTCCCATT GCACGACTGA 4 020 

5 

AAATATGAAT GTTTTGATAC GCTATTATTT TATCAAAAGT TACGATAATT AGATATAAAA 4 080 

TGAACTTACA AAACAATGTA AT CATTAAAA AAGCTACAAT CGCTTCAAAT CGATTTTGTA 414 0 

GATGATTAAA ATGAAACGCA AAAGTTGTAT TAAATGCTGT TGTTTTAG GA TATGGAATAA 42 0 0 

W 

ATACAATTAA TCTTTCTACA ATAGATTTGT AAAATTGACT GGCAATCCAC AATGATACAA 426 0 

nCGTTGCACT CAAATGTATC ATAGATAACC AAAAACCTCG TCTGAATCCA ACGATGACAA 43 20 

75 AATACACAAA GAAAATGATT ATGATAAAAT CAATGACCAT TTATTGCTCA CGCTGCTGCA 4 3 80 

ATTTGTGAAT TTGTTGTTTC AAACGTCGAT TTTTTTTCTT CTAGTAGTAC TTTTTCATGC 4440 

ATAATATTCA CAGCAGTTAG TATTGCTTTT CTTGAAGTAT CTAAACCTGC TG CTTTATAC 4 500 

20 CCTAATTCTT TTATTTTATC ATCAACTAAA TGTG CTACAT ATCGTATGTG CTCTGGGTTA 4 56 0 

TCTTCCCCAA CAATTGTAAA AAGCTGATCA TTAATTGATA CATTTACCTT GTTTTTAAAC 4 62 0 

TGTGTCATTT ATAATTTCTC CTGATCCTTT TTTTAAAATC TAAATTCACG TTATAAAATA 4 680 

25 

TGACTGGATA GTTTGTCTGA ATTTGATACT AATATTGTTA TATTGCAATT ATGATAAAAC 4 74 0 

AACAACACAA TCTCTATAGA TGACTTAATG TTCTTTTTAT AATGAAATAA TGTAAAGAAT 4 800 

30 TTTCTATTCA ATACTTTATC ATGTTTAAAT TGTGTCACTA TAACATTTTC ATAAACATTA 4860 

TACATGACCA CTATGTATTT TGTAAGTATC CGCAATTAAT TCTTTACAAC ATACATAAAT 4920 

GTTTCTGACG TTATTATCAT TTATGATATG ATTATTTTTG CTAAAGACAA TGAAATTTTA 4980 

35 TGAAAGGATT TACACAAATG GCGAATATCG TTTTTAAATT GTCGGATAAA GACATAACGA 504 0 

CATTAATGTC ACGCATTTCT TTTGATACTG AGAATTTACC TCAAGGnATG AnAGCACGTG 5100 

CAAAGTATCC AAAATACAAC TGTAAATATT TACCAT 513 6 

40 

(2) INFORMATION FOR SEQ ID NO : 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4239 base pairs 

(B) TYPE : nucleic acid 

4S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 60 

ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 
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GGTATCTTTC AAAGATAAAA TCTTAATAAT TTCTTAGTAA ACTCTTTTCT CTAGATTTAT 24 0 

CACAATATTA TATAGACCTA TTTTATTTTG ACGTAAGTTG CTAGTATCTT CAAACAAAAA 3 00 

AC CTTTATAA AAATTCATAC CTTTATG CTA TCGCTGTAGG CTCATTAACT TGTTACATAT 360 
AATTCTTAAC TATCCTTTGA TGATTGTTTT ATTAGATTGT TTCGTTGATG GATACTTTCA 42 0 

CGAATTTCTA TAGTTCAATG CTACTAAAAA AACAGCCCAA AACTTTAATT TGTTTTGGAC 4 80 

TGTTTTATAA TTATGCTTGC GATGGTGTTT TAGTTTCTGA AGTTTGTTCA GCAATGTCAT 54 0 

ATTTAAACTC TTTACCATCA TGATCTACTG TAACTTTCTT ACCTTCAATT TGATTACCAT 6 00 

CTAATATTAA TTCACTTAAA TTATCTT CGA TAGTTTTTTG TATCGCTCTA ATTAATGGTC 6 GO 

TTGCACCATA TTCTGGATCA TATCCTTCTT CTGCGATTTT GTCTTTCGCT TTATCAGTTA 720 
CAATAATATT TATGTTTTGT TCAGATAATC GATTTGTTAA TTTATTAACC ATCATTGTTA 7 80 

CAATTTCTTT TAATTCTTCT TTTGTTAGTT TATGGAATAC AATGATATCA TCTACACGGT 84 0 

TTAAAAATTC TGGACGGAAT GAATTTTTTA ATTCTTTTAA CATCGTTTTT CGAATTGTTT 900 
CATAATCTTG TCCATCACTT GAACCACCGA ATCCAGCAAA TCGTTGATCT TGTAATTCTT 9 SO 

GTGCCCCAAC GTTTGATGTC ATTATGATAA TTGTATTTCT GAAATCAACT GTACGTCCTT 102 0 

TTGTATCTGT CAAATGTCCA TCATCTAAAA CTTGTAATAG AATATTAAAT ACATCTGGAT 1080 

GAGCTTTTTC AATTTCATCA AATAAAATTA CAGAATATGG TTTACGTCTA ACTTTTTCAG 114 0 

TTAATTGTCC ACCATCATCA TGACCAACAT ATCCTGGAGG AGCACCAACT AATCGGCTCA 1200 

CTGCGTGTTT TTCCATAAAT TCACTCATGT CTACACGGAT CATCG CAT CA TCATCGCCAA 12 60 

ACATTGATTC AGCTAAAGCT CTAGCTAATT CAGTTTTACC AACACCAGTT GGTCCAAGGA 1320 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT CTTTTAACCC TGCACGGGCA CGTCTAACCG 13 80 

CTTTACTGAT TGAATTAACA GCATCTTTTT GCCCAATAAC TCTCTCATGT AATGTATCTT 14 40 

CTAGACTAAG AAGTTTTTCA GATTCTGTTT CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

ATCCTGCAAT AACTTCAG CA ATATCTTCTT CTGACAATGA AGTTGaCATG CCATTTTGTG 156 0 

CATTCTTCCA TTCATTTTTA G CTTCTT CAT ATTG CTTTTC AAGTTTTGTT TGTTTATCAC 162 0 

GCAGgTTAGC AGCATTTTCA AACTCTTGAG CATGTACTGC GGCATCTTTT TCATTTTTAA 16 8 0 

CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA AATTATTAGG TGTCGTATGA CTCTTAAGTC 174 0 

TTACTTTAGA ACTTGCTTCA TCAATTAAAT CAATTGCTTT ATCTGGTAAG AAACGATCTG 18 0 0 

AAACGTATCT GTTACTTAAT TTAACAGCTG CTTCAATAGC TTCGTCTGAA ATATTAATAC 18 6 0 

GATGGTGTGC TTCGTAACGA TCTCTTAATC CTTTTAAAAT AGCAACTGTA TCTACTACTG 1920 
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TTTTGCGATA TTCATCTAAT GTAGTAGCAC CAATACATTG TAATTCACCA CGTGCTAATG 204 0 

CCGG CTTCAA AATATTCGAA G CAT CG ATAG CACCTTCAGC ACCACCAGCA CCAACTAAAG 210 0 

5 TATGCAACTC ATCAATAAAT AGGATGACAT TACCTGCTTG TTGGATTTCT TCCATAACCT 216 0 

TTTT CAGACG CTCTTCAAAT TCACCACGAT ATTTAGTACC TGCAACTACT GTTCCCATAT 222 0 

CTAAAGACAT AACACGCTTA TCTTTTAATG TCTCTGGTAC CTCATTATTC ACTAtGGCtT 22B0 

to 

GCGCTAAACC TTCAGCAATA GCAGTTTTAC CAACACCTGG CTCTCCAATA AGCACAGGAT 234 0 

TGTTTTTCGT ACGTCTACTT AATACTTCAA TTACACGTGT AATTTCTTTA TCACGTCCTA 24 00 

^ TAACAGGATC TAATGTACCG TCTTTGGCAA TGACTGTTAA GTCACGAGCT AAACTATCTA 246 0 

AAGTTGGAGT ATTATTTGAC TTACTAGCTT GTGCATTTTT ATTACTCATT TCAGGGTTTC 2520 

CTAAAGCTTT CACAACTTGT GCACGTGCTT TAGTAATATT TAAATCTAGA TTTGCAAAAA 258 0 

20 CTCTTGCTGC AACACCTTCA TTTTCACGAA TCAAGCCTAA TAAAATATGT TCCGTTCCAA 2 64 0 

CAAAATTGTG ATGTAATTTT CTAGCTTCAT CCATCGATAA TTCAATGACT TTTTTAGCTC 2700 

TAGGTGTATA ATGCAATGTA CCAACATGAT CTTGACCATG TCCGATTAAT TTTTCAACTT 2 76 0 

25 CTTCAATTAC TTTATCTTCA GTGATATTAA AACTTTCTAA TACTTTTGCA GCAATTCCTT 2 820 

CAGGTTCTTT CATTAACCCC AATAATAGGT GTTCTGTTCC TATATTTGAA TGATTTAAAC 28 8 0 

GAATTGCTTC TTCTTGGGCA TGTGCTAATA CGCGCTGTGC ACGCTCAGTT AATCTACCAA 294 0 

30 

ATAACATAAA TAATGACCTC CTACTTTATA TGTTCTCTTA GTATATCTGC TCGTTTTTCT 3 000 

TTTACAGATT TGTCATCTTC TTCATCTAAT AAAAATGGTG ACTGTATAGC TACCATCAAT 3 06 0 

35 TCATTAAATT TAAAGTTTTG TAATTCAATG TAATTTAAAT CTATACCAAG TTTAACTcGC 3120 

TTAATCTATA AGAAGCCTCT TCCATAGTTA TCATTCTACA GTTTTGTAAA ATACCTAGCG 3180 

AGCGAAAAAC ACGGTCTTGT GTTTCTAATT GATTATAAGT GTCTAACTTT TGTCGTATTT 324 0 

40 GTTTTTCTTC ATGAATGATT TGATTAACAA CTTCTGTTAA TGTTTCTATG ATTTCTAACT 3 300 

CAGATTTACC AAGTGTAAGT TGGTTGGATA CTTGATAAGT ATGT CCATAA ACTTGCGAAC 3 360 

CTTCACCGTA AATACCTCTG ATTGTATATC CAAAACGATT AATGGTTTGA GCAATCCGTG 3420 

45 TCATTCTTTT CATAATAGAT AGACCTGGCA AATGTAG CAT CACGCTTGCT CTCATACCAG 3 48 0 

TACCTATATT GGTAGGACAT GTAGTTAAAT AACCAAGTTG TTCATCATAA CTTATATCAA 3 54 0 

GGCTTCGATC TAATTCATCA TCAATTGATG AAGCTTGATT ATATAAAGCC TGTAATGTCG 3 6 00 

so 

TGTCAGTTCC CATAGCTTGA ATACGAATAT GGTCCTCTTC ATTTATCATG ACACTTAAAG 3 660 

ATTCATCGTC ATTCACTAAT ACTGCGGCTG CTGGTTGTTT TATTAGTTCA GGACTAATCA 372 0 
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CAAAGTTTGG CAAGGCATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 3 84 0 

GCATACATTA GTGGATGCAC ATGATTTTCT AAAtTACGCG CTAACCGAAT TCTAGAAGAC 3 9 00 

ATAA CAATTG GTGTTTCTTC ATTACTTTTC ATCCATTGGC TGATATTATC ATGAATATTA 3 9 60 

TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 4 020 

GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTCTTC 4080 

GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 414 0 

AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 4 2 00 

ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATC 4 23 9 
(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CAAAACTTTG ATTATGTCGT GTCAGTTATT AATGCATACA GTAATGCATC GTTGTTATTT 60 

TTTATTTATT TACTATTTAT TATGTACCAA ATCAGCCTTt CAGTAAAATA GGTCAAATTA 120 

CTGATTTTCT AAATATAAAA TGCCTCCTAA TAACATACTA TTAGTACATC ATTAAGAGGC 180 

TCTTGTGTTA TTTGCATACT AAGCGCTCAA ATTTAAATTT AAGATGAAGA TTCTTGCAAT 24 0 

AATATTTCTA TATTTGTCGA TATATCCAAT GGATCTTCCA TTGGCAAGAA TCGATTTACA 3 00 

ACATTTCCTT GTCGATCGAT TATAAATTTT GTGAAATTCC ATTTTATTGG GGaCCCAAAG 3 60 

ATTCCTGGTT GTTCaTTCTT TAAATGCGtA AATAACGGAT GTTCATCGTT CCCGTTCACA 420 

GATATTTTAG CTAACACTGG AAATGTTACA CCAAATTTCT CACGACTAAT TTTCAAGATT 4 80 

TCTTCATTAG AACCTGGTTG TCGATTGTCA AAATTATTAT TCGGAAAACT CAACACTACA 54 0 

AACC CACGAT CCTTATATTT TTGAAATAAA GTCTCTAGTT TTTT CAATTG TTCGCTATAT 6 00 

ATACATTCTG TTGCAGTATT AACAACTAAA ATCACTTTAC CTCTAAATGC TTCTAATTTA 6 60 

TAAGTTAAGC CTTTATAATC ACTTACTTCG ATATCATACA CATTTCTATT ATTCATAAGA 720 

CACCCCTACA CAGCCTTTTT TATATTGAAT ATGTTCTTTT TAGAATGTTC TGATAAAATA 780 

AGTGCGCGTT TACACCGTGA ACACACATTA TATAGCGTGA TACATTTTTC GAGCACACGA 84 0 

TAAATAATGT TCGAGTTTAT GTTGTTGCTC AACCTATCCG ATTTACCGTC TTTTTTCACC 900 
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CCGTTATAAC CCCTCATTAT AATCATCCTT ATTTTCTATT TTTAAAAAGA CAATTAGACC 1020 

GCTCTTTAAA CTATAGaTTA ATACTTAAGk TAAACTCATA CATACTGATA CCATACGTTA 1080 

5 GATCTAACAA TTTAAAATTC GTTATAACTA TGGATTAAAG AGCTGCCCAA CTCATATAAT 114 0 

CCTTAAAAAC TTCACATGTG ATTGTkTATT AAGCCCTCCT TTATCnTATT AAATATCCTT 1200 

ATAACCCTTT TAAAATTAAA CTGACACACT CATACATTGT TACAC 124 5 

w 

(2) INFORMATION FOR SEQ ID NO: 3 3 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1399 base pairs 
, s (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

CTTGTAAAGG TAGTTGTTCA TTTAAATTAA AACAGTATGC TAAGTTnGTG CTTATATTTT 60 

GCAACTTCAA TTCGACAGGC TCATCTTGAT ACATGAATGC CTCAATTTCA TCATGTGATA 12 0 

25 

ACTTTTTCAA AATATCAATT AAATGrATAC TAAAAACGAT AAATAACATA TCCCAATTTG 180 

AATTCAGCCC TAGCGATTTT AATTTGTTTA AAATCTTATC TTTTTGAAAA ATTCGATTCC 24 0 

TAATGTCTTG TATATCGTTA TCAGTCAAAG TTTCCCAATC TATATGTGAA TGAAGACCTA 3 00 

30 

AATAACACTT ATCCATTAAT AATTCATATA CCGTTAATGC AGAGACATCG AAACAACGTT 3 SO 

CTTCACTTAr AAAAACGCCA TCAACATCAA ATAAAATTTT CTTCACAATC CCCACTCCAT 420 

3S TTCTGaAAAT TCAGaTATAA AT CATTCTAC TATTTGACTA AAAAAAGCGC AAACCCTATT 480 

GAAGTaGATT TGCGCTTTAG CTGTTnAAAT TTTATAAATG TnTTTCAATT TCATCAGCAA 54 0 

CCTGCTGTAC GTGTGTACCG ACAATAACTT GAGTTGAATG TTTGCCATTA ACAGTAACAC 600 

40 CAACTGCACC GGCGTTTTTA ATCTTCTGTT TATCAATAAT AGATGTGTCT TTTAACTCTA 6 60 

GACGCAACCT TGTTGCACAA TTGGTTAAAT TAACAATATT CTCTTGACCG CCTAAACCTT 720 

CTAATATTTG TATAG CATGT TGATGATATT TACTTTGTTT AATATCATTT TCAC CAGGAG 780 

45 

CAATATTATC TTTTACAACT GTTGGGTCAA CTAATTCATT TTCACCTCTA CCAATCGTAT 84 0 

TCAAGTTAAA TACTTGGATT ACTACACGGA AAATCACATA GTATAAGATG AAAAATACAA 90 0 

CACCTTGAAC AAGCAACATC AATGGATGAT TTGATACTGG ATTAATTAGT GATAACACAT 960 

50 

AATCTATCAA ACCTGCACTA AATGAAAATC CAGCTGTCCA ATGGAATGTA GCTGCGATAA 1020 

ATAAAGATAA TCCTGTTAAT AACGCATGAA CAACATATAA GATTGGCACA ACAAACATAA 10 80 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 120 0 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 126 0 

CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 1380 

CACCATATAT GAAAGCTCC 13 99 



(2) INFORMATION FOR SEQ ID NO: 33S: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

,s (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

TATAGTTUTA TTATTTAGCG AAGCATTAAT ACTACCACCG GTTATAACAA ATGTATTTTG 60 

CGTATTAAAT TGAATGGTAG GACCAAT CAA AGTATATTCA ATCGCTGGAC CATCATTGTT 12 0 

25 AATTAATGAC TGCGCAACCT TAAAACTAAA TTGATCCATG GCACCTGCGC CTGAAAATCC 180 

AATATGTTCA TAACCTATTC TTCCTAGATC TTGTACCGTT GAAAAGAGAC CTGGTTGTAA 24 0 

AATCTTAATT GACATTTTCA ATCACCACCC AGTCATCAAC ATTAAAGTTG CCATCTGATA 30 0 

30 

TATCTCTTTC GATTTGTATA AATTTCTGTT CATCTATTGC ATAAAATTGT ATCCATTCTC 36 0 

CTGCTTCGTA CATTGACATT GGTTCACGCT CGCTGCTAAA TACTTTTAAc GGTGTGCGTC 420 

CAATAATTTG CCATCCGCCA GGAGAATCTG ATGGATATAG TCCTGTTTGA TTATTCGCAA 4 80 

35 

TACCTACAGA ACCTGCATGA ATTTTTAACC TTGGCTGATT ACGTCTAGGT GTATGTAGTT 54 0 

GTTCATCAAG TcCGCCTAAG TATGGAAATC CTGGCATAAA TCCTAGCATA TATATTAAAT 600 

40 AAGGTTTACT TGTATGTTTT TCAATAACTT GCTCAACAGT TATTCGATTA TGCTTTGCTA 660 

CTTCTTCAAT ATCTGGTCCA TATGTACCAC CATATTGAAC AGGTATTTTA ATAATACGAT 72 0 

TGGTTTGATT CACAGCATGA ACATTTTTTT CATTAAATTT GTTAAGTTCT AAATTTTCAA 780 

45 TTAATTTAGA AGATGTTATA GCTTGTTCAT CAAAATATAT TAGAACTGCT CGATACGAAG 84 0 

GGACAATATC TTGAATTTCT AATATTTCTT TTTCTCGTAT CCACCGTACC ATTGCTGTGA 90 0 

CATTACGATA TGTCTCTTCG GATATTTTAT TTTCAAAATA AATCATAATT GTCTGCTCGT 96 0 

SO 

TAATAAATCT TACATCCACT TTAAATCCCC CTTTGTATTG CAATAAACCA GTATTGAATA 102 0 

CCTTTTCATT GTATCATTGA GAAGCACAAG TTGTTTAATA AGTAATTCAA ATCG CAT ATA 10 80 

ss 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT 12 00 

TTTACTTTTC ATCTGTCCGA TTTTTTGATT TTGAATATAA AAAAGCACGA CCGAAGTATC 12 6 0 

s ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 132 0 

ATTTTTACG 13 2 9 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 12 0 

GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG AGAACCAAAA 180 

TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 24 0 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 30 0 

AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 360 

AAATC CTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 

ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 480 

CATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 54 0 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 

CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 66 0 

TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 72 0 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 780 

ATGGATTATT CACAT CTTG A AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 84 0 

TTACATAGAT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 90 0 

GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 96 0 

ATTtATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGAC TAAAAT 102 0 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 108 0 

AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 114 0 
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TACACATTTT GAGGAGTATG TTACAGATCA CGTGCATCCA ATGACAGCAT TACGTACGTC 1260 

ATTATCATAT ATTG CACATT TCGATCCTGA TGCTGAAAAT GAATCAGATG AAAATCGTTA 132 0 

TGAAAGAGCA ATGCGTATAC AGGCTAAAGT AGCATCATTA GTTACAGCGT TTGCTCGAGT 13 80 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA TCCTGACTTA AGTTATGCGG CAAACTTCCT 144 0 

ATATATGTTA CGTGGGGAAT TACCAACAGA TATAGAAGTA GAAGCCTTCA ATAAAGCACT 1500 

TATTTTACAC GCTGATCATG AGTTGAACGC ATCTGCATTT ACGGCACGTT GTGCGGTATC 1560 

ATCATTGTCA GATATGTACT CAGGTATTGT AGCAGCCGTA GtTCTCTGAA AGGGCCATTA 162 0 

CATGGTGGTG CAAACGAACA AGTTATGACG ATGTTAT CTG a GATTGGGTC AaTTGAAAAT 16 8 0 

GTTGATGCTT ACTTAGATGA AAAATTTGCT AATAAAG r T A AAGTAATGGG cTTCGGTCAT 174 0 

CGTGTATATA AAGATGGTGm tCCTAGaGCG AAaTATTTaA GaGAAaTGAG CCGTCAAaTT 18 0 0 

mCGAAAGACG CTGGTCGTGA AGAATTATTT GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 1860 

GCAGAAGAAA AAGGATTAAT TCCTAATGTT GATTTTTATA GTGCGAGTGT TTATCACTGT 1920 

ATGGAAATAC CTCATGACTT ATTCACGCCA ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 198 0 

ATTGCTCATA TTTTAGAACA ATATAAAGAT AATAGAATTA TGCGTCCTAG AGCGAAATAT 2 04 0 

ATTGGCGAAA CGAATCGTAA GTATATCCCG CTTGrAGaAA GAAAmTAATC AATACAAATT 2100 

AAAAATGAAG ATGTAAAATT TGGAGGTAAA ATAACTATGA CTGCAGAAAA AATTACTCAA 2160 

GGAACTGAAG GATTAAACGT ACCTAATGAA CCAATTATCC CATTTATTAT CGGTGATGGA 2220 

ATTGGACCGG ATATTTGGAA GGCAGCAAGC CGAGTTATAG ATGCTGCtGT TGAGAAAsCC 228 0 

TATAATGGCG AAAAACGCaT TGAATGGAAA GAAGTGCTAG CTGGCCAAAA AG CATTTGAT 234 0 

ACAACTGGTG AATGGTTACC TCAAGAAACA CTTGATACAA TTAAAGAATA TTTAATTGCT 24 00 

GTTAAAGGAC CTTTAACAAC ACCAATTGGT GGTGGTATTA GATCATTAAA TGTGGCTTTA 2460 

CGCCAAGAAT TAGATTTATT TACTTGCTTA AGACCGGTAC GTTGGTTTAA AGGAGTACCA 2520 

TCACCTGTTA AACGTCCACA AGATGTTGAT ATGGTTATTT TCCGTGAAAA TACTGAAGAC 258 0 

ATTTATGCTG GTATTGAATT TAAAGAAGGT ACAACAGAAG TTAAAAAGGT AATTGACTTC 264 0 

TTACAAAACG AAATGGGTGC G A CAAACATT CGATTCCCAG AAACTTCAGG TATTGGTATT 2700 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA TTAGTTAGAG CAGCTATACA ATATGCTATC 276 0 

GATAATAACC GTAAATCAGT TACTTTAGTT CATAAAGGTA ATATTATGAA ATTTACAGAA 2820 

GGCTCATTTA AGCAGTGGGG TTACGATTTA GCATTATCTG AATTTGGTGA TCAAGTATTC 28 80 

ACTTGGCAAC AATATGACGA AATTGTTGAA AATGAAGGCA GAGATGCTGC TAATGCTGCT 294 0 
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TTACAACAAA TTTTAACTCG TCCAGCTGAG CATGATGTTG TAGCAACTAT GAACTTGAAT 3 0 60 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 3180 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 3 24 0 

TTAGAACATT TAGGATGGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 33 00 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 3 3 60 

TCTACATCAG CATTTG CAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 34 20 

G 3421 
(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 
CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATnGCT ATaACTCTAA ATATAATAGA GATAATCmcA 120 

TTAATTGGAT TAAt ACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 180 

AGTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 24 0 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 3 00 

CAATGACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 3 60 

TAAACAACTT GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 4 20 

TCGTC CACAA ATATTATTCT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 4 80 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGGAC 54 0 

ACGCAAAGTT CTCCTTTGAC AT ACGGATG C GCATTGATAC AACT AG CAAT TGCTAAGGCA 6 00 

TCAGTCACTC TTTCATTAAG ATCAC CTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 6 60 

GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA 72 0 

ATAGCTCCTG TATAAAC CGT TT CATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 780 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTATCATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 
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ATTTTGTCAA ACTCACAAAT CGTTTCAGCA CCACTAATAT GAACAT CTTG ATTGCTAGAA 1020 

CGCATTTTTA TACTATACAT GACGATCACC TcAATCTTCT TGaTGCAAAA TTTCAAACAA 10 80 

CCTATCTATA TCTTGTTCAG TATGAAAATA C GACAAT GAT ATTCTTAACA TTGGCTTAGT 114 0 

CACAGTtGGA TACCTTAAAT AACTTGTAAA CACATGATGC TTTAATAATG TTTGATGAAT 12 00 

GTTCTCAGCC GCTTCTATGT CATCAAACTC AATAAACTTA ATCGGCGAGT TTGaACTATT 12 60 

ATAATkAACA TTGAGTGCTT TTAACTTTTG GTTAAAATAT TTACTCAAAC TATTTAATTT 1320 

AGTGCGTCTA TCATCAGCAT TTATTAACTT TTCAATGTTT CTTTTTATAA AATACAAATT 13 8 0 

ATAAATTGGC AAACTACTTG AGTAGATGAG TGGTCTACCG TGATTAATTA aCATATCCtT 144 0 

CaCATCaTTT GaACTkaAAA TcACACCCCC GTATGCACCA CATGCTTTAG ATAAACTAGA 1500 

AGTGAGTATA TCTACACCTT GATAATTCGA GTAAtTCTCT ATtCCAAAAC TAT GTGAAA C 156 0 

ATCGAGTATC AGTGTTGCGT TAnATTTATG CTTTAAT GAG ACTAATTGAC CAATATCCAC 1620 

AACGTCGCCA TTCGTTGAAA ATACACTATC AGATATGATT ATTTTTGGTA TATTTTGATT 16 80 

AGGGTATTTC TCTAACCTTT TTTCTAAATC AGCAATATCT AAATGCTTAT ATATCACTTT 174 0 

TTCTAAACCA CTTAACTTAA TACCGTCAAT AATACTC GCA TGATTTTCTT GATCTGAAAA 18 0 0 

CACGACACAA TTTGTATTTT TGAAAATATT AAATAACGCC AAATTAGCAT CATAACCACT 18 60 

ATTTAAGATA GTACATGcAC TATATCCGAG CCAACCTGCT AACATTGTTT CAATTTCTTC 1920 

ATAAGCTGTC GAACTTCCAC TAATTAATCT TGAACTTGAT AAGT GATAAC TATACTTCCG 1980 

CATAAATCTT TCGAAATCAT CCTTATCAAA CGCTATTTGA CCTAATCCTA AATAATCATT 2 04 0 

AGATGTATAG TTCGTACATC TCTTATTTTC TACTTCAATA TACTGTCTAT CTATATACCC 210 0 

TACCGATTTA AGCGACCGAT ATAACCC TTT CTGTTGTAAT AAATCAATTT GCTCTTGAAA 216 0 

CTTCATTCTT GTTTTCCTTA TTTTCACAAG TGTCATAATC AATTTCAAAG CCTAAATCAT 222 0 

TAATCATATC GTAGTCTAAT TGGTTCGGTT GCCCACCAGT AATTAGATAA TnCACCGACA 22 80 

AATATTGAAT TCGCCGCTTT TAATGCTAAT GGCTGTAACG AACGTAAGTT GACCTCTCTT 234 0 

CCTCCAGCAA TACGAATTTC TTTCGTAGGA TTGATTAATC GGAATAATGC TACGATTCTT 24 0 0 

AAACATTTCA TTGGTGTTAA ATCATCCATG CTTCCAAACT TTGTGCCTTT GATTGGATGC 24 60 

AAAAAATTAA TCGGAATACT GTCGGCATCC ATTTCTTTTA AAGCAAATGC CATATCAACA 2520 

ATATCTTGAT TAGATTCTCC CATACCACAA ATCACGC CAG AA CATGGTG A TATATTATTC 2 58 0 

GCTTTCATTA GTTCTATCGT ATCTGTTCTA TCTTTATAAC TATGCGTTGT CACGACGTTA 2 64 0 

TCATGGTAAT TTTCACTTGT ATTAATATTG TGGTTATATC TGTCTACACC AGCTGACTTA 270 0 
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TGTTGAGATT TAATCGTTCT TACAGTATTA CTAATATGAT CAACTTCTTT ATCGCTCGGT 2820 

CCTCTACCAC TCATAACAAT ACAATATGTT CCAATATGAT TATCATGTGC CACCTTTGCT 2 880 

CCATCGATAA TTTGTTCCTC TGGAATTAAA GCATATCGCT GTTTTTGTTT AATATCTCGT 2 94 0 

GATTGTCCAC AGTACCCACA ATTTTCAGGA CATATACCAC TTTTAG CATT TAAAATCATG 3000 

TTTAATTTTA CTTTTTTACC AAAATAATGT TTTCTTAAAA TGTACGCCTC ATTTAATAAA 3 060 

TCTAAGGTAT CAATATTAGT ATCCTCATAA ATtTTCAATA CAGTCTCTTT TGTtAATTGT 3120 

tCCCCTTGTA ATATGCGTTT AGCCAAATTC ATATTAACAC TTCCTATCTA AAA 3173 
(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

CGATTATCCA TTAATACAAC CCTAAGTAAA TGTATAAAAA TTATCTTCCA CAAACTTCAA 60 

CAAAAGCCTA AATAAATTAC AGCAATTTAT CAAATATTGC TTACTTTGAT TTTATGAAAT 120 

nACTTAATTC TAACACATAC TAAATCATCA TATACTAATT CGAAATCAAA TGCATTTAGA 180 

GATAATCaAA ATGCGGAAAC ATCTCCaATA ATCAATAATC TATTCCCAAT AAATATGAAT 24 0 

GTTCTCAACA ATACATTATT TATAT CTCTT TACACTGTCA TCGACAAAAA CTAAATCTTT 3 00 

CACTTTCAAT TTCGAACGTG GTTCTACGAC ATTTGCTGCT ATATCATTTA AT GGGATT AA 360 

AACAAATGCA CGTTCATTCA TTCTCGGATG TGGCACCGAC AGTTTTGGTA AATCTATCAT 42 0 

TTCTTCTCCA TACAACAAAA TATCCACATC TAAAGTTCTA GGACCCCATC GTTCCTTTCT 480 

AATACGGTGT AAACATTCTT CTGTCkTCAA ACAACATTCC AACAGTTGTA ATACTGTGAG 54 0 

TGTTGTTTsA ATTTCAACAC ACAAATTTAA AAAGTTAGGT TGCTCAGTAT ACCCAACTGG 600 

TGCTGTTTCA TAAATCGGAG AAATA t TAGA TACGTTAATA CCATCATATT CATTCaAAAT 660 

CTTyATAGCA TCGTTTAACt GGCTTTCTCT ATCACCAaTa TTACTACCTA AyCCTAAGTA 720 

TGCTTGAATC ATyTATTCTC CCTCACTATT TCGATACCTA CTCCATCATA ATGACCCGGA 780 

ATCGGTGGGT TTTCTTTAGT GATTCTCACT TTCGTTTCCA TTACACGATT ATATTGTGAA 84 0 

TTTATACGAT TTGCAATACG TTCAG CTAGA TGCTCAAGTA AATTAACGGC CTTACCTTCC 900 

ATAATTGATT TAACCTCTTc GAACACTTCA CCATAATGAA CTGTAT CAAT AACATTATCA 960 
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ATTT CATTTT 


CAGCTGATAA 


AGCACCATGA 


TATCCATAAA 


AGCGCATACC 


TTTAAGAAAG 


1080 


ATTGTGTCTT 


G CATTTT CAT 


TCTCCTTTAA 


AAAATCTATA 


CCTTTAGCTA 


ATTTAGCATT 


1140 


CAACTCGACA 


TTATGAACGC 


GTACTGCTCT 


AACGCCTTTC 


ATAATACCAT 


ATGCAGTCGT 


1200 


AGCTGCAGTT 


ACTTCATCTC 


TTTCAACCGG 


TGTTGTATCA 


TAACCCATCA 


TCTCTTTAGT 


1260 


GAAACGTTTC 


CGGCTTGTCG 


CTAATAAAAC 


TGGATATTCT 


GTTGCAACAA 


GTTCATCCAG 


1320 


TCTTGCCATA 


ACTTCGGCTT 


CTTCATTTCT 


AGTTTTAGCG 


AAACCTATAC 


C TGGATCT AG 


1380 


CCAAATTTTA 


TTTGAAGGTA 


TACCAGCTAT 


TTTAGCTTGA 


TGTGCTTGTG 


CTAACAAAGA 


1440 


TGTTAACATT 


TCTTCGACAA 


CCGGTTCATC 


ACGATTACCA TTTCCATTAT 


GCATTAAAAT 


1500 


AATTTCCGCG 


TCATAtTTAG 


CTACAATTTG 


GAACATACGA 


TGATCATACA 


GACCgCCcAT 


1560 


t GATC ATTAA 


TCATATCAAC 


GGCTAATTTT 


AAACATGCTT 


CAGCAACCTC 


ACTTCGAAAT 


1620 


GTATCGACTG 


AAATTTTTTA 


CATCAnAACC 


GACAATAGCT 


TCAnCAACAG 


TAATACTCTG 


1680 


TTCCATCTCT 


TCTG 










1694 



(2} INFORMATION FOR SEQ ID NO: 34 0: 

2S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
AGCATTTCTT TTCTATAAAC ATTTAATTGA ACATTATTAA GTACACTATT ACTATAGTCA 60 

35 

CTATATTGAA CaCATACCTC ATTTAATTCT AATAGCGGTT CAGATTTGTA CTTATTATCA 120 
TTATTTGCAG ATGTTTCATC TATCCATTTT TTCACTTTAA ATTTAACATG TTCACTCATA 180 

40 CAAACGTCAC GTAAATTCGC TAAGTTATCA ATGGATTCGA CATCTACTTC TGCATATTTA 24 0 

AGCGCTGTAC AGTATAATGG TTCACGTATG CCTGCTTCTT TAAGCTTAGA TGATTTTAG C 300 
AAATCACTAG GCGTTGTATT AGCGATGATT TTTCCATCTT TAAAAAGAAG AACTCTATCA 36 0 

45 AACGTATCAT CTAATGATTC TTCTAATCGA TGTTCGACAA TAATCATCGT TGACTTTGTT 4 20 

TCTTCATGAA TATTGTTTAA CAATCTCAGC GTTTCATGTC CTGTCGCAGG ATCTAAATTG 4 80 

GCCAGTGGCT CATCCAATAT TAAAATAGGC GTACGATGGA TTAATATACC ACCTAATGAA 54 0 

50 

ACGCGTTGTT TTTGACCTCC AGATAAATCT TGCGGTCGGT GATTTAAATG TTCTATCATG 600 
CCAACTTTTT CAGCCCAATA ACTTA CATTT TTCTTCATAT CATCTTGTTC AACACAATTA 660 
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TCTTGTAAAA CTGTACCAAC AACATTAGAT CTATCATGTA AACAACTAAC GGTTGCATCT 780 

T GATTATTTA TATATAGTTC CCCAGTTATG TTACGTTTAG TTTTAAATGG AATTAATCCG 84 0 

S TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCGAAG CACCAACTAC TAATACTTTT 90 0 

TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 1020 

ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC TTACGTAAAC TACCTTTTTT 1080 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA ATAATGCCAA CTGAAATAAT 1140 

1S ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG TTAGCTGGTT CGTTATAAAT 1200 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTTG AC C 126 0 

GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 1320 

20 TCCAATCCAG CCATATAAAC AGCCTATAAT TCCCGAGC 1358 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 1: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 60 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 120 

35 

CACATATGAC ATTGAGAn GG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGCGTG 180 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 24 0 

40 CAGCAGAAGT AATGAGCAAA TTATTTTCAC ACCGCAATTA GTGTTTGTCA TTACAGATAT 3 00 

GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAGATTTAT CAGAATATGG 360 

TATTTCATTA ATCTTTGTTG AAGATG t GAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 420 

45 TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTTCA 4 80 

ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 54 0 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 600 

SO 

AGAGATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT GGAGACAAAA 660 

CGAAACATAC AAAACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 720 
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AGGGAAATCT GAGATTATCC AATCATACAT TTTATCTTTA GCTATTAATT TTCACCCTCA 84 0 

TGAAGTTG C A TTCCTATTGA TTGACTATAA AGGTGGGGGT ATGGCGAACT TATTTAAAGA 9 00 

5 TTTAGTCCAT TTAGTTGGTA CGATTACAAA CTTAGATGGC GATGAAGCGA TGCGTGCCTT 96 0 

AACATCAATC AAAGCCGAAT TGAGAAAACG TCAACGTTTA TTCGGAGAGC ATGATGTTAA 1020 

CCATATTAAT CAATACCATA AGTTATTTAA AGAAGGTATT GCGACAGAAC CAATGCCACA 10 80 

10 

TTTATT CATT ATTTCCGATG AGTTTGCCGA ATTAAAATCA GAACAACCTG ATTTTATGAA 1140 

AGAACTTGTA TCAACGGCAC GTATTGGACG TTCGTTAGGT ATTCATTTAA TACTTGCGAC 12 00 

ACAAAAACCA TCGGGTGTTG TTGaTGACCA AATTTGGTCT AACTCTAAAT TTAAGTTGGC 12 60 

IS 

ATTAAAAGTA CAAGATAGAC AAGACAGTAA TGAAATTTTA AAAACACCAG ATGCAGCAGA 13 20 

CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAa T AATGAmATTT ATGAATTATt 13 80 

z0 CCAATCTGCA TGGAGTGGTG CAACATATGA CATCGAAGGC GATAAATTAG AAG TTGAAGA 14 4 0 

TAAGACGATT TACATGATTA ATGACTATGG TCAACTTCAA GCAATCAACA AAGACTTGAG 15 00 

TGGACTTGAA GATGAAGAAA CGAAAGAAAA TCAAACTGAG TTAGAAGCGG TCATAGATCA 1560 

25 TATCGAATCT ATTACAACAC GATTAGAAAT CGAAGAAGTT AAGCGTCCAT GGCTACCACC 16 20 

ATTGCCAGAA AATGTATATC ArGAAGATTT AGTAGAAACa GATTTCAGAA AATTATGGTC 16 80 

AGATGATGCA AAAGAAGTGG AATTAACATT AG GACTTAAA GACGTACCAG AAGAACAATA 174 0 

30 TCAAGGACCG ATGGTATTGC AATTGAAAAA AGCTGGGCAC ATCGCGTTAA TCGGAAGTCC 18 00 

AGGATATGGT AGAACAACGT TCTTACACAA CATTATTTTC GATGTTGCAA GACACCATCG 1860 

TCCTGATCAA GCACACATGT ACTTGTTCGA TTTCGGTACC AATGGTTTGA TGCCAGTTAC 192 0 

35 

AGACATACCA CATGTCGCTG ATTACTTTAC AGTAGATCAA GAAGACAAGA TTGCTAAGGC 19 80 

GATACGTATA TTTAATGATG AAATTGATCG TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 204 0 

4o CACTAGTATT TCTGAATATC GAAAATTAAC TGGTGAAACA ATTCCGCATG TCTTTATTCT 2100 

TATTGATAAC TTTGACGCAG TAAAAGATTC ACCTTTCCAA GAAGTTTTTG AAAATATGAT 2160 

GATTAAAATG ACGCGTGAAG GGCTAGCATT AGACATGCAA GTAACCTTAA CTGCTTCAAG 222 0 

45 AGCTAACGCT ATGAAAACAC CAATGTACAT TAATATGAAA ACGCGTATCG CAATGTTTTT 22 8 0 

ATATGATAAA TCAGAGGTGT CGAACGTAGT AGGACAGCAA AAATTTGCGG TTAAAGATGT 234 0 

TGTGGGTCGA GCATTGTTAA GTAGTGATGA CAACGTATCA TTCCATATTG GCCAACCATT 24 00 

50 TAAACATGAT GAGACCAAAT CATATAATGA TCAAATTAAT GATGAAGTAT CGGCGATGAC 24 6 0 

AGAATTTTAT AAAGGTGAAc AC CAAATGAT ATtCCTATGA TGCCAGATGA AATTAAATAT 2 52 0 
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GGATTAGATT ATGAAGGTGT TACACTACAA AAAATTAAAT TAACTGAACC AG CAATGATT 26 4 0 

TCATCAGAAA ATCCGAGAGA AATTG CGCAT ATTGCTGAAA TTATGATGAA AGAAATTGAC 2700 

5 ATATTAAATG AAAAATATGC GATTTGTATC GCAGACTCAA GTGGAGAGTT TAAAGCTTAT 276 0 

AGGCATCAAG TGGCTAACTT TGCCGAAGAA AGAGAAGACA TTAAAGCGAT TCATCAACTA 282 0 

ATGATTGAAG ACTTAAAGCA AAGAGAAATG GACGGCCCAT TTGAAAAAGA TTCACTTTAT 2880 

w 

ATTATCAATG ATTTTAAAAC ATTTATTGAT TGCACGTATA TTCCGGAAGA TGATGTTAAA 2 94 0 

AAGCTTATTA CAAAAGGACC AGAACTTGGC TTGAACATTT TATTTGTCGG CATTCATAAA 3 00 0 

GAATTAAt AG ATGCTTATGA TAAACAGATT GATGTTGCAC GTAAAATGAT TAACCAATTT 3 06 0 

15 

AGTATAGGTA TTCGTATTTC AGACCAACAA TTCTTTAAAT TTAGATTTAT TCAACGAGAA 312 0 

CCTGTTATTA AAGAAAATGA AGCATATATG GTCGCAAACC AAGCTTATCA AAAGATTAGA 3180 

20 TGGTTTAAAT AG CAATGAAT TAAATAGGAG GGAGGTATGT TATGAATTTT AATGATATTG 3 24 0 

AAACAATGGT TAAGTCGAAA TTTAAAGATA TTAAAAAGCA TGCTGAAGAG ATTGCGCATG 3300 

AAATTGAAGT TCGTTCTGGA TATTTAAGAA AAGCTGAACA ATATAAGCGA TTAGAATTTA 33 6 0 

25 ATTTGAGTTT TGCACTAGAT GATATTGAAA GCACAGCAAA GGACGTACAA ACTGCAAAAT 3 42 0 

CTAGTGCTAA TAAGGACAGT GTAACTGTTA AGGGAAAGGC GCCCAATACG TTATATATTG 34 8 0 

AAAAAAGAAA TTTGATGAAA CAAAAGCTTG AAATGTTGGG TGAAGATATC GATAAAAATA 3 54 0 

30 AAGAATCCCT CCAAAAAGCT AAGGAAATTG CTGGCGAAAA GGCAAGTGAA TATTTTAATA 3 600 

AAGCAATGAA TTAATATTGA GGTGAAGATA TGGGTGGATA TAAAGGTATT AAAGCAGATG 366 0 

GTGGCAAGGT TGATCAAGCG AAACAATTAG CGGCAAAAAC AGCTAAAGAT ATTGAAGCAT 3 72 0 

35 

GTCAAAAGCA AACGCAACAG CTCGCTGAGT ATATCGAAGG TAGTGATT GG GAAGGACAGT 3 78 0 

TCGC CAATAA GGTGAAAGAT GTGTTACTCA TTATGGCAAA GTTTCAAGAA GAATTAGTAC 3 84 0 

^ AACCGATGGC TGACCATCAA AAAGCAATTG ATAACTTAAG TCAAAATCTA GCGAAATACG 3900 

ATACATTATC AATTAAGCAA GGGCTTGATA GGGTGAACCC ATGATGAAAG ATGTTAAGCG 396 0 

AATAGATTAT TTTTCTTACG AAGAATTAAC AATTTTAGGT GGTAGTAAAT TGCCTCTCGT 4 02 0 

45 AAATTTTGAA TTGTTTGATC CATCAAATTT TGAAGAAGCT AAAGCTGCTT TAATTGAAAA 4 080 

GGAATTAGTA ACAGAGAATG ACAAGTTAAC TGATGCAGGT TTTAAAGTG G CTACATTAGT 414 0 

CAGAGAGTAT ATTAGCGCCA TTGTAAATAT TCGAATTAAT GATATGTATT TTGCACCATT 4200 

50 TAGCTATGAA AAAGATGAAT ATATTTTGTT AAGCCGGTTT AAAAATAATG GGTTTCAAAT 4260 

ACGAATTATC AATAAAGACA TTGCATGGTG GTCGATTGTA CAATCATATC CTTTATTGAT 4320 

55 
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CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAATTGAAA TATACAATCA 444 0 

TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TGTTATTCAT 4 500 

TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 4 557 
(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAATATTCT TCGAAGTATT TGATGTTTTA 60 

CAAGATTTAG GATTAACAAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 120 

TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGGTAAGTG AAAAAGTTAG AATTCACACG 180 

CCATCACAAA TGGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 24 0 

AGTATCGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA ATTATCATGa TAATGAAGAA 3 00 

ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 360 

GaTTGGTTTA AACGTAAAAC AAACAAAAAA GATACTCATG aAAATGAAGT AGAGTCAACA 420 

GATGAAGAAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA TCATGaACAT 480 

GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 54 0 

TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 600 

GATTTAATCA TTTAGCGACT TTAAAGGTCA TTGGTGTAGG TGGTGGC GGT AACAACGCCG 66 0 

TAAACCGAAT GATTGACCAC GGAATGAATA ATGTTGAATT TATCGCTATC AACACAGACG 720 

GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA AATCGGTGAA AAATTAACAC 780 

GTGGTTTAGG AG CAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 84 0 

AACAAATTGA AGATG CAATC CAAGGTG CAG ACATGGTATT TGTTACTTCT GGTATGGGTG 900 

GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTAAAAT TGCAAAAGAA ATGGGCGCAT 96 0 

TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 1020 

CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATAC CAAATG 10 8 0 

ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 114 0 

ACAACGTGTT ACGCCAAGGT GTACAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 12 00 
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GTATTGGTGT TTCTTCTGGT GAAAATAGAG CGGTAGAAGC TGCTAAAAAA GCAATCTCTT 1320 

CTCCATTACT TGAAACATCT ATCGTTGGTG CACAAGGTGT GCTTATGAAT ATTACTGGTG 13 80 

GCGAGTCATT GTCATTATTT GAAGCACAAG AGGCTGCTGA TATTGTCCAA GATGCTGCAG 1440 

ATGAAGACGT TAATATGATT TTCGGTACAG TTATTAATCC TGAATTACAA GATGAGATTG 1500 

TTGTAACAGT TATTGCAACT GGTTTTGATG ACAAACCAAC ATCACATGGT CGTAAATCTG 1560 

GTAGCACTGG ATTCGGAACA AG CGTAAAT A CTTCTAGCAA TGCAACTTCT AAAGATGAAT 1620 

CATTCACTTC AAATTCATCA AATGCACAAG CAACTGATAG TGTAAGTGAA AGAACACATA 16 80 

CAACTAAAGA AGATGATATT CCTAGCTTCA TTAGAAATAG AGAAGAAAGA CGTTCAAGAA 174 0 

GAACAAGACG TTAATCGGTT AATATATATA CACAAATAAT TCAACACAAA TCATCAGATA 1800 

ACATATCTGA TGATTTTTTT ACTAATTTTT AGaACATGTA GAAGGACATT TAAGTTTTTC 18 60 

aAAGTTATTA AAAGTGTTTA AGTATCGTGT GAAAATTAAG TCaAAAATTA TTTGCGCAAC 1920 

ATTTTAACTT TAAACATAAA TGTTATATTA TATAATTATT AACTTTGTAC AGTTAGACGA 19 80 

AGATAATTTA AATGAAATGA TGGTGACGAT CGAGTGAATG ATAATTTTAA AAAGCAACCG 2 04 0 

CATCATTTAA TATATGAAGA GTTATTACAA CAAGGTATTA CTCTAGGTAT TACAACTAGA 2100 

GGAGATGGTT TAAGTGACTA TCCTAAAAAT GCTTTTAATA TGGCGAGATA TATTGATGAT 2160 

CGCCCATATA ATATTACTCA ACATCAATTG CAATTAGCTG AAGAAATTGC GTTTGATAGA 2220 

AAAAATTGGG TGTTTCCCAT TCAAACACAT GAAAATAAAG TCGCTTGTAT TACAAAGGAT 22 8 0 

GATATAGGCA CAAATATAGA CACTTTAACT GATGCGCTTC ATGGTATTGA TGCGATGTAC 234 0 

ACATATGATA GTAATGTCTT ATTAACGATG TGTTATGCAG ACTGTGTACC AGTATATTTT 24 00 

TATAGTACAA AACAT CATTT TATTGCATTG GCGCATGCAG GTTGGCGTGG TACCTATACT 24 60 

GAAATTGTAA AAGAAGTGCT AAAACATGTG AACTTTGATT TGAAAGACTT ACATGTCGTT 2 520 

ATTGGACCAT CTACATCATC AAGTTATGAA ATTAATGATG ATATTAAAAA TAAATTTGAA 2 5 80 

ACATTGCCAA TTGATAGTGC CAACTATATT GAAACTAGAG GACGAGATCG TCATGGTATT 264 0 

GATTTGAAAA AAGCCAATGC TGCATTATTA ATTTATTATG GTGTTCCTAA AGAAAATATT 2700 

TATACGACAG CGTATGCTAC ATCTGAACAT TTAGAATTAT TTTTCTCTTA TCGATTAGAA 276 0 

AAAGGTCAAA CAGGACG CAT GTTAGCATTC ATTGGTCAAC AGTAAACAAG GAGGAGATAT 2 820 

GTTTGCGTGT GAAAGATAAT TTACAACAAA TCTCAACACA AATTAATGAC AAAAGTGAAA 2 880 

AAAATAATTT TTCAACAAAA CCAAACGTGA TTGCAGTTAC AAAATATGTT ACAATAGAGC 2 94 0 

GAGCTAAAGA AG CGTATGAG GCTGGAATAA GACATTTTGG TGAGAATAGA TTGGAAGGCT 3000 
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AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGATTA TTTCCATGCT TTAGATC GAT 3120 

TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 3180 

5 TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTGCTTT AGAAGAT GTT GAT CAGTTT A 3 24 0 

TAGATGATCT TAAAAAATAT GACAAAATCG AAATTGTAGG TTTAATGACG ATGGCACCAT 3 3 00 

TGACAGATGA TGAAGCATAT ATTAGATCGT TATTTAAACA GTTACGTTTG AAAAAAGAAG 33 60 

10 

AAATACAACG ACTCAATTTA GAATATGCGC CTTGTGATGA ATTAT CAATG GGAATGAGTA 3 420 

ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAGAATTGGG ACTAAACTTG 3 4 80 

TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 3 54 0 

75 

AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 3 600 

AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC 3 6 60 

20 TGCATCAAGA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 3 720 

AAATAATTCA CGTAATGTTG TAACTATGAA CAATGCTACA CCAAACAATG CATCACAAGA 3 7 80 

AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTG CTGA 3 84 0 

25 TGAGCTTAAA AACCGCCGTG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 3 900 

GAAAAGAATT ATTGATTTTT TAAGCGGTAC T 3 931 

(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

40 AATTGTCGGG GGACT CTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 120 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

45 ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 24 0 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC T AATTGG TAG 3 00 

TTGGATTGCA TTAGTATTAT ATGCTG TTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 36 0 

50 GCCAGTAATG GATAAAATTA ATCAAC CAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 420 

TGGCATTCCG TTTTGG GG AT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 4 80 

55 
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AGGTATTAGA 


TATTACCTTT 


TCGAAAAACG 


ATACCATCCA 


TTTATTGCAG 


CAATTGTAAT 


600 


TGGACTTATC 


GCACTCTTAG 


CTTGGCCAAT 


GAGTGCATCA 


ACTGGAAGAA 


ATGACGGTTT 


660 


AGGTATAACA ACGCCTTCAG 


CAAATTTAGT 


ACACTTTTTG 


ATTACAGGTG 


AAACTAAATT 


720 


TATTGATTGG 


GGTGTCTTTT 


TAGTTCTAGG 


AATTTTCATT 


GGTTCATATA 


TTGCAGCTAG 


780 


AGGATCAAGA 


GAATTTAAAT 


GGCGATTGCC 


AGACAAGATT 


ACAATACGAA 


ACAGTGCCAT 


840 


TGGTGGCATA 


TGTATGGGAT 


TTGGTGCGTC 


AGTTGCTGGT 


GGTTGTTCTA 


TCGGTAACGG 


900 


TTTGGTTGAA ACGG CAACGA 


TGACTTGGCA AGGATGGATT 


G CGCTAg CAT 


gCGATGATAG 


960 


TTGGTGTATG 


GACAATGAGT 


CATTTTATCT 


TTGTTCGTCC 


AATGAAAAAA 


GTACAC CAAC 


1020 


AATCTGCAAA 


GGTTAAACAG 


CAAACGCAAA 


TAGTATAGAA 


GATTATTATG 


CAAATGATGT 


1080 


TGATCAAATA 


AAAGTGATTG 


GAAAAGGAGA 


AATAATTATG 


ATACACGAAT 


TAGGTACAGT 


1140 


AGGAATGGTA 


TGTCCATTTC 


CGTTAATTGA 


AGCGCAAAAG 


AAAATGGCAA 


CATTGCAATC 


1200 


TGGAGATGAA 


TTAAAAATTG 


ATTTTGATTG 


CACGCAAGCG 


ACGGAAG C CA 


TTCCAAATTG 


1260 


GGCTGCAGAA 


AATGGTTATC 


CTGTAACAAA 


CTATGAACAA 


ATTGATAATG 


CTTCATGGAC 


1320 


AATTACAATT 


CAAAAAGTTT 


AACGTTATCA 


TTTTAACAAT 


AAAATAGATA 


TTAGATTCTA 


1380 


TGGCTACTTC 


CGCTAATTTA 


AAAGTGAGTA 


AGTAGTCTTT 


TTTTTTTTAG 


TTCATGAAAT 


1440 


CATTTTTATA 


TAGTGTGGCA 


CATTTTATTC 


CAAAAGATGT 


AATAAAACTT 


AACGCATTTT 


1500 


TGCTTTTTAT 


AAATTGTCAG 


ATTATTATGA 


AAAAAAGGGA 


GTGGTAAGTA 


TGAATCTTAA 


1560 


CGATACGATA 


TTTATGTTTT 


TGTGTACATT 


ATTAGTTTGG 


TTAATGACAC 


CAGGATTAAG 


1620 


TTTATTTTAT 


GGTGGGTTAG 


TTCAATCTAA 


AAATGCGCTT 


AATACTGTCA 


TGCAAAGTAT 


1680 


GGCAGCAATT 


GTGCTTGTTA 


CATTTGTATG 


GATAACAGTT 


GGTTTTACAA 


TTAGTTTTGG 


1740 


GAATGGGAAT 


TTATGGTTCG 


GAAATTGGGA 


ATATACTTTT 


CTTAATCATG 


TAGGTTTTGC 


1800 


GACTCAAGAA 


GATATTAGCC 


CACATATTCC 


TTTCGCTTTG 


TTTATGTTAT 


TTCAAATGAT 


1860 


GTTTTGTACG ATTGCAATTT 


CTATTTTATC 


TGGTTCAATC 


GCTGAGAAAA 


TGAAGTTTAT 


1920 


TCCTTATTTA 


TTATTCGTAG 


TAATATGGAC 


TGCTCTTGTA 


TACAGTCCAG 


TAGCACATTG 


1980 


GGTTTGGGGC 


GGCGGTTGGA 


TTAACAAACT 


CGGTGTATTA 


GATTTCGCTG 


GAGGTACGGT 


2040 


TGTTCATATT ACATCAGGTG 


TTTCTGGTTT 


AGTATTAGCT 


ATTATGATTG 


GAAAAGGAAA 


2100 


CAAACATTCT 


GAATCAACAC 


CACATAATCT 


TATCATTACG 


TTGA1TGGCG 


GTATATTCGT 


2160 


GTGGATTGGT 


TGGTATGGAT 


TTAATGTAGG 


TAGTGCTTTT 


ACATTTGATA 


ATATT G CG AT 


2220 


GCTTGCATTT 


ACAAATACTG 


TCATTTCAGC 


CAGTGCAGGT 


GCTATAGGTT 


GGTTAATTTT 


2280 
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ATTAG TTGTC ATTACTCCTG CAGCAGGATA TGTAACATAT CTTAGTGCAA CAATAATGGC 24 0 0 

TTTAATAGGA GGTATCTGTT GTTATATTGT CATTAATTAC ATCAAGGTAA AACTAAAATA 24 6 0 

TCATGATGCA TTAGATGCAT TTGGTATTCA TGGTGTTGGT GGTATTATTG GTGCTGTTTT 2 52 0 

AACAGCAGTT TTCCAAAGTA AAAAAGCCAA TCCTGACATT GAGAATGGCT TTATTTATAC 2 58 0 

TGGTGACATA CATATTATAC TTGTACAAAT ATTATGTGTA ACAGCAGTTG TAATTTTTAG 264 0 

TATAGTCATG ACGTTTATTA TTGCGAAAGT AATTAAATTa ATTACACCAT TATCTGTTAC 270 0 

GGAACAAGAA ACGAATATAG GATTAGACAA GATTGTTCaC GGTGAACATG CTTACTTTGA 2760 

AGGTGAGCTA AATAGATTCA ATAAACATAT TCGATATTAG AATATATTTA CATAGAATAT 2 82 0 

TCATTGTCCT GACATTTAAC TAAAGG TTG A TGTTGGGACA TTTTGTTATA CAAAAGTTTT 2880 

ATTTTGAAAT C TTTTTATGA AAGAAGCAGA AATATTATTT AAAGCGGTTA CACATATGCT 294 0 

AAAATAAGGC TAAGTGTCAC AAATAATGAT AGGTGAATAA GTATGAAAAA TATATCTGAT 3 00 0 

ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT ACAGTATCTA GATTTTTAAA TAATGGATCT 3060 

GTCAGTAAAA AAA CAAGTGA AAAATTAACA AGAATTATAG CAGAACATGA CTATCAACCG 312 0 

AATCAATTTG CTCAAAGTTT AAGAGCGAGA 3150 
(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 60 

ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 120 

CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 18 0 

ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 24 0 

TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 300 

ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 3 60 

GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 42 0 

TAAAATATAG AAAATACATA AAAGTAAGTA TAGTTATTTT ATTATAATTA TTAAATTTTT 4 80 

ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 54 0 
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ATCGTTTCAA TATTACTTAT AGGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACGCA 6 60 

AAGAGGTTTT TCTATTTCGC TACTAGTTGT TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 720 
CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 7 80 

GATC CAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TG CGACATTA 84 0 

ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 900 
AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 960 

GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 1020 

GAGTTTGACA AAGGTCAAAG AACTGATAAA TATGGACGTG GCTTAGCGTA TATTTATGCT 1080 

GATGGAAAAA TGGTAAACGA AG CTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT 114 0 

TATAAACCTA ACAATACACA TGAACAACTT TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

GAGAAATTAA ATATTTGGAG CGAAGACAAC GCTGATT CAG GTCAATAATG CTCATTGTAA 126 0 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTTAGA TCACGATATG ATTTATTATC 132 0 

AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 13 80 

AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATGTG TATTATCACA 14 4 0 

GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 150 0 

AGAGCGTGAA GAAAAATGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 1560 

TGTTACAAAA TATTTAGTGC AAAAGCACGA CGGAGGTATT CAATATGaAT AACGGTACAG 1620 

TTAAATGGTT TAATGCAGAA AAAGGTTTTG GTTTCATCGa AAGAGAAGAT GGTAGCGACG 16 8 0 

TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 174 0 

AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 1800 

TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTTTTTTAT GCTTTAAATC 18 60 

GATAACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 198 0 

TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 2 04 0 

TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACTCAAAATG 2100 

TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 2160 

AATCAACTTT TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTTT 2220 

TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 228 0 

ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 234 0 
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ATTTCTGTGA GTATTTGGAA GCTACCATTA GGCAACGGTT TAACAATAGA CAATTGCTTT 24 6 0 

TCCGCTTGTT GTATTAAAAA AGGTTTTGTA GATTGATTAT TAATATGCCA TTCACTCATG 252 0 

5 TATGTTTTTC ACTCCTGCTT TAAAATAGGG TTAGAAAGTT TATAGTTGAG ACATTCATGT 2 58 0 

TCAACCAAAA TTTTGTTCGA ATTCAATAAA TGTCTTGTTT AAAATAGAAA TATTG TAAAT 264 0 

GTTATCGTCC AAAACTTCAC CAGTTAAGTA TTTGTTTTGA ATTAAAATTT GG CAGTTAGT 27 0 0 

'° TAAGAAGTCT TGATAATCAC GATCG CAAAA ATAGTTTTCA CGTGCATCTT TAGCATCGCC 276 0 

AAAAAAGTTA GCGACTGTTT CTGTTTCTCC TTTATTCGAA CGTTCAATAT ATAATTTGTA 282 0 

AAATTTAGCT ATTGTATACT TTTGTTCTTT AGTTAGTTCA TTCAAAATAT TGGGCCTCCT 28 8 0 

15 

GAAATATCAT TTGTAATCTA TACCCAATTT ATTGCAAAAC AAAAACTAAT TTAACTATTT 294 0 

GATGAAACTG TGTTAATAAg CTTTAACAAG CCTTAGTTTG TATGGATCTA TAAAATTATC 300 0 

20 TTTAATTGCA TAGGGTGAAA TAATATGTAG TCCATAACTT TTAACTGATT TTTCACTTAC 3 06 0 

ACCAAATTTA TAAGCTTGGT AGATAATTTT AGTACAATAC GTAAATTTTT TGCTGTTCAA 312 0 

ATTTAATGTA ACTAGATAAC GATGATTTGT ATTCTCATAG TTTTTCTTAA CCCATTCAGC 318 0 

25 CGCTTTTTTA CCTGCACCAG GATAGCTGCA AC GATAAACT TTCATCCAAT CATTTTTGCC 32 4 0 

ACTTGCATAA TTATATTTAA AAGATTCGAA GGATTGTGTA GTTGGTTTGT CGCCAGGCCC 33 0 0 

CTCAATTTGT AAAATCGTTT TATCATCAAT CGCGATACTA CAATGACCAA AAAATCscCA 33 60 

30 CATGACAGGG CCTTTTGTAA CAATAATATC ACCAGGTTGT AATTGGAATT TGTCATCTTG 34 2 0 

AATTTCTGAA TACTTATTAT CTG CAATTGT TTTTGGTGAG TTTATTGGGG ATACGACAAC 34 8 0 

GAATAATATA AGTAAAATTA TCGTTCGTTT AATATAGTTC ACTTAAAAGC TCCTTGTTGA 3 54 0 

35 

AGAAATATAT GTAAATAGTC TTAAATTAGA ATTGTAATCT TTAATAAGCT TGtAAGACTA 36 0 0 

AAACATATCT TAAATATTAA AGTATGAGAG TGTGAAATGT CTATTAAGAA TnAAAAACAG 3660 

4Q TCTGAAACAT CATTGAGACG TTCCAGACTG GATATAAAAT GAATTTCATT TATAGCACA 3719 

(2) INFORMATION FOR SEQ ID NO: 34 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 

TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA AC TGAACGCA 6 0 

55 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 180 
ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACGGGA ATTCCCAATT 24 0 

CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 3 00 

ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 3G0 
TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 420 
ATTTTC CTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 4 80 

ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AG CTGATG CA ATTTGTCTGT 54 0 

CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 600 
TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TT CGATAT AT ACGAAGGTCT 660 
TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 720 
ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGT CTTG A 780 

TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 84 0 
ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 900 
ACTAAATTAC GTGATC CTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 96 0 

TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 1020 

TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATTCACACCG 10 80 

TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 114 0 

TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 1200 

CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 12 60 

TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG CTACACCATT AAAATAACTA 13 20 

ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 13 80 

CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 144 0 

TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGAT CAA TCACACCAGC CTTGACATAA 15 00 

GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 1560 

TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTTG CAGTT 1620 

CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 16 76 
(2) INFORMATION FOR SEQ ID NO : 34 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

TACAGTAGGA ATCATAAAAC C TAATA CAAC AAATACAAAA CCATTTAAGG CAT AAC T AAA 60 

TGTGTTCCAA ATTT GATGG T AATT CATTTG TAGTTCCGTT TGTGCTCTAA TTAAACGGTC 120 

GCGTTCTAAA CCATGGATTA GACCTGCGAT TACAACTGCA ATGATACCTG AAGCATGAAC 180 

TTCTTCTGCT AAAAAGTATA CGACAAAAGG AGTTAATAAT TGAATAAAAG TTAAGGTATT 24 0 

GTTATCTTTT AAACCTTTAT TAGCGGTTAA GTCTATACGT ATTCTAACGA CAACGAATCC 3 00 

AATAATTGCA CCAATAAGTA CACCTAGTAT TGTTGAAATG AT AAATT G TT CAACAGCTTG 360 

GAATAATGAA AAGGTACCAG TTACTAATGC AGTAACAGCA ATTTTAAATG AAATGATACC 42 0 

TGCTGCATCA TTGAGTAAAG ATTCACCTTC TAAAATCGTC ATAGAACCTT TAG GTAATAA 4 80 

TTTTCCGCGT GTAATAG CAG ATACTGCTAC TGCATCAGTA GGACATAAAA TTGCTGCTAT 54 0 

TGCAAAAGCG GCTGGCATTG GTAAGGCAGG CCAAATCCAA TGTATAAAAT AGCCAACACC 600 

GACTACAGTT G CAAACACT A GTGCCATTGA CATTAATAGT AT AG GTTTAC GATATTCTAA 66 0 

TAATTTTGTT CGAGAGACGT GGGTACCTTC CACAAAAAGT AGTGGCGCGA TAACGGCAAA 720 

CATAAATACT T CAG AATT G A ATTGGAAATC AACTTGTATT GGAATAATGA AAATAACGAC 780 

ACCTAATGCA ATTTGAATAA AGGCAGTAGG AATTTGTGGG AATCGATTAT T GAT AAC C G A 840 

ACTAATAATC ACAGCAAAAA TAAAAATTAA AAATGCTTCT AATAGTG CCA T ACAAT ACT C 90 0 

CTCAAAATTT TAATAGTTAA T ATTTT AT CA CTTTTAAGGC ATAATGACAT AGATATATTG 960 

ATAAAATGAA GTTATTTTCA AAAAAACTCT AGTATCGGTT GAACTGATAC TAGAGCGAGA 1020 

TGTTTAAATT ATTGATTGTc ATATCTGAAA TGACCGCTGT CATTTTGTCG TTGTTCATAC 10 80 

GCGAGCTTTT CAGCATT CGT TTTGTATTTT TtATAAAAGa AAAATAaAAA TATnAACCaG 114 0 

AATGGCGAAA TATAAATAGC TGCTCTTGtT TCGTCACTAA AGAATAATAA AATGAATACA 12 00 

AAGAAGAAGA ACGCTAGAAT AATGTAAGCA ATAGGCTTAC CACCAATCAA CTTAAATTTA 12 60 

CTGTTTTTAT GTGCCTCAGG ATGCTTTTTC AAAT 1294 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACATGATAAT GATGACGCTA TTAAAACACG TTTTTTATTT TTCATTGTTA TAACCTTCTT 60 

TCGTATGATT GATATTTGTT GATATGTATC GACATGTGAA TAATATCACA AAAACAGAGA 120 

ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 180 

TAAAAATAAT GTGTGTACAA AGTCATTAAT TTAGCAAATA TTTTTATTTA GTAGTTAATA 24 0 

ACCATCGATT TGAAATTTAT ATATAATTAT TAGCTAAATA ATATCCTGCA TCTTTCTCAT 300 

ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGmC 36 0 

AGTTAATTTC ATAGAACGAA ATCAAATAAC ACACTACTTT CTG CATTTTA AATTATGTTT 42 0 

AAGAATCAnA ATTATGTTTA nATAAATATA TATACTACTT TGAAAGGTGT GAG CTTAATG 4 80 

ACAACTTTTA GTGAAAAAGA AAAAATTCAA TTACTAG CAG ATATTGTTGA ACTACAAACT 54 0 

GAAAATAATA ATGAAATAGA CGTTTGTAAT TATTTAACAG ATTTATTCGA CAAGTACGAT 600 

ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT 66 0 

AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 720 

GATAAT TGGT CATATCCCCC TTTTCAACTG ACAGAAAAAG ATGGCAAATT ATATGG CCGA 78 0 

GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAAT CGAATTAAAA 84 0 

GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTG CTGG CGAAGAGAAA 90 0 

GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGCTATT TAGACGATGT CGATGGCTTA 96 0 

ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 102 0 

AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 1080 

ATTGATACAC TGCTTGAATT TTATAATCTA TTTAAAGAAA AATATT CAG A GCTTAAACAA 114 0 

CAAGATACTA AACATGAATT AG ATGTTG CG CCTATGTTCA AATCATTGAT TGGAAAAGAA 120 0 

ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 126 0 

GGCGGcAAAC AATTTAACTC TGTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 132 0 

GTTCCTGAGT ATGATAACGA CTTTATAGAA TCGTTTTTCC AAAATATCAT TAATGATGTG 13 80 

GATAGCAATA AGCTTTCACT CGATATTCCA AGCAATCACC GACCTGTAAC AAGCGATAAA 144 0 

AATAGCAAAT TAATTACTAC GATTAAAGAT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 1500 

ATATTTGTTT CAGCGCTTGT AGGCGCAACA GATGCCTCTA GTTTCTTAGG AGATAATAAG 1560 

GACAATGTTG ATTTAGCCAT TTTTGG AC CA GGTAATCCAT TAATGGCACA TCAAATCGAT 162 0 

GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 16 8 0 

CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGGCAGT 174 0 
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TCAAATATCA ACAAGCACAT TTT CATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT 1860 

GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 1920 

AAACTCGTAA AAnCC 193 5 
(2) INFORMATION FOR SEQ ID NO: 34 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 8: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTAGAA 6 0 

AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 24 0 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 30 0 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTG CAGT AGATACAGTG 360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 480 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 54 0 

ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 6 00 

ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGG TAT TGCCGATGAT 660 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 720 

TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTG CCGATCA 780 

AAGATATTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 84 0 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TG ACAG T AAA GGTGTGTTTA ATAAAGGTAC 900 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 960 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 1020 

GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 1080 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 1140 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 1200 
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GAGTCACTTC TCATTTGTTG ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 1320 
ATTTGATCAT GCATTTGAGT GTGTGGGTGG T 13 51 

5 

(2) INFORMATION FOR SEQ ID NO: 34 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

'5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

TCATCAAGTC TACGATAAAT TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 60 

GTATAACCAA CTGCACAGTG GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 12 0 

20 ATGTTACTTA AAGTTAATTG GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 180 

TTTTGAATAC TTATTTTAGG CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 240 

GATACCAATT TGATACCGTT TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 300 

25 

ATAAATGAGA GTACGTTTGA ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTCCATTA 360 

ATAAATACAT ATCTACACCA TTATTAATTA AaTAGCTAGC GTACGAGTGT G 411 
30 (2) INFORMATION FOR SEQ ID NO: 3 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

TCATTTTCAT AGG TT ATT AC GCAGATCAGC ATAATAATCC ATTCCATATG AGTc CTTATT 60 

TTGGTTATGC AGCACGTCTA TTGGCAACAA GTGGCATTGA CTATACGTAT GTAAGAATGG 120 

45 CAATGTACAT GGATCCACTT AAACCATATT TACCAGAATT GATGnATATG CATAAACTGA 180 

TTTATCCnGC TGGCGATGGT CGTATTAATT ATATTACTAG AAATGATATT GCTAGAGGTG 240 

TCATTGCTAT TATTAAAAAT CCAGATACTT GGGGCAAACG CTACTTATTA TCAGGCTACA 300 

50 GTTATGATAT GAAAGAACTT GCTGCAATTT TATCTGAGGC ATCaGGCACA GAAATTAAAT 360 

ATGAGCCCGT TT CATTAGAG ACATTTGCAG AAATGTATGA TGAAC CT AAA GGCTTTGGTG 420 

CATTATTGGC AT CAATGT AC GACGCAGGAG CAAGAGGACT ATTAGACCAA GAaTCCAATk 4 80 
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TTAATAATAA AGGAGCGTTA TAG TG AATAT CATCTCAACA ATTt TAATCA TATTTGTGGC 6 00 

ATTAGAGTTT TTCTATATTA TGTACCTTGA AACGATTGCT ACAACTTCCA AAAAGACTAG 66 0 

CGAGACATTT AATATAAGCG TCGATAAATT GAAAGACAAA AATATTAACC TACTTTTGAA 72 0 

GAACCAAGGC GTATATAACG GTTTAATCGG AGTTTTGCTA ATATACGGTT TGTTTATCAG 7 80 

CAGTAATCCA AAAGAAATAT GCGCAGCTAT TTTAGTGTAT ATCATTGGCG TTGCTATTTA 84 0 

TGGTGGCCTT TCAAGCAATA TTAGTATCTT TTTCAAACAA GGCACATTGC CAGTATTGGC 900 

ACTCATATCA ATGCTTTGGT AAGTATTGGT GTTTGGGGGG GTGGAGATGT AGTCGGAGGT 96 0 

TTGGAGGATT TGAGCGAATT GTGTGTGGAC TTTAGACTCA GAGTATTTCA TCCTAATTAT 102 0 

TTCAAGCAGA GGTGACAGTA GCGTTGCCTC TGTTTCCTTA TAAAAAAATT ATTT AA TG AA 108 0 

GAAAACCCAT ATCTGATTTA ATTTTCAGCT GATAAATACT CCATATATTA GAATGGCTAC 114 0 

TTTATCTATT GCATCAATCC TTTAAAACAA AAAACCCATG ATTTCGAAAT TCCCGTATGA 1200 

TGGGGTTCCT ACTCTCATGG ATCAGTTAAA TAAATATTAT CACTATCAGT TTATTATTTC 126 0 

AATATTATTA ACAATATATG TAGTCGTAAA AGGAAAGAGG ACATGAGAAC TTCGGTGTTG 1320 

ATTGG CATT A CATAACG CTT CCAAACATAT TATTTGGTAA CAATAAGAAA CTATTTACAC 13 80 

AATATATTTT GTATAGTAAA ATTATTTTAT AAT ATTT AAA TCCAATTGCA CAAGGAGTGA 144 0 

TTATCaTGGT ACCAGAAGAA AAAGGTTCTA TTACTTTGTC AAAAGAAGCA GCTATCATAT 1500 

TTGCAATCGC AAAATTCAAA CCATTTAAGA ACAGAATTAA AAATAACCCA CAAAAAACAA 156 0 

ATCCATTTCT TAAATTACAT GAAAACaAAA AATCTTAATC ACTTT t ATTT ATAGcATTTC 16 20 

TAATCTCAGA AATGCTATA 16 3 9 
(2) INFORMATION FOR SEQ ID NO: 3 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 60 

CCAAGCGCAC C CATT AAT AA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 120 

TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 180 

AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 24 0 
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TTCTyTATTT 


GTAaTAACGC 


TGCAAACATA 


TCCATAACCC 


CGCTTCTTAG 


AGCCCCTTTT 


360 




AATTnATnAA 


TTAGGGGCTC 


TTATGCAGTT 


GGTG CATTAG 


CAACCAACTG 


TATTCCTTTG 


420 


£ 


TCCCCTTTTA 


ATTTATTaAT 


TAGGGGCTCT 


TTTGCTGTTG 


GTG CATTAG C 


AACCAACTAC 


480 




GTTCAATTTA 


ACCGAATAGT 


TTAAAATTAA 


ATACAAACCT 


TAAATTAGTC 


TAAAACTACG 


540 




CCTTTGGTTG 


TTCAACAAAG 


CTCGCCATGA 


GATTTACAAA AGAATCAACT 


TGTGGCAATT 


600 




GCAACATGCT 


CGGATCATAA 


CTCATAAATG 


TCGAACGAAT 


CAGCGGTTCA 


TT AT CAATTT 


660 




CTACTTTTTC 


AAACTCAAAT 


TGTTCTTTGC 


TGATATTTTT 


CATCATAATT 


TCTGG CAAGA 


720 




TTGTAACACC 


TACACCACTA 


ATCAACATTT 


CTTTGCAAGT 


TGCTACTTGA 


T C CACTGTAA 


780 




TAGTTGCATG 


GTAATCTTGT 


TCTAAATTAT 


CGTTATACCA 


TTCTTTTATT 


TGATTTATAT 


840 




AAATCGGATC 


AGCTTGAAAC 


TCTATAAATG 


GTAACTTTGT 


AACATCATCT 


CGTCTATTTT 


900 


20 


TTGGAAAAAT 


AAAATAATGA 


TCATCATTAA 


ATAAATGTGT GTTAGCTAAA 


TTCATTACCT 


960 




TATTTCCACG 


AGTTATCATA 


ACATGATAAT 


CTCTATGATT 


TGCTTTAATT 


TGTTCAGTTG 


1020 


25 


AACCAACTTG 


CACTTGTATT 


TCAACATTAG 


GAAATTGGGC 


ATTATATAGG 


CTCAAAACTT 


1080 


CAGGAAGTAA 


GGTTTGTCCA 


ATCAAAGAAG 


AACACCCGAT 


TGATATTGTT 


CCATTCACTT 


1140 




CACCAATATG 


TGCCTGCATT 


TTGTCAAAAA 


ATAATCGCTC 


TCTTTTCAAC 


ATGTCACGAG 


1200 


30 


CATGCTCAAT 


AATCATTGTT 


CCTTCAGTTG 


TTGTAATCAA 


TTGTTTTTTT 


GTTCTGATAA 


1260 




AAATATCTAC 


TCCAAAAGCA 


TTTTCAATAG 


CTTTTAGTCT 


TTGTGTAACA 


GCAGGTTGAG 


1320 




ATATATATAA 


AATTTCAGCC 


GCTTTACGTA 


ACGTTTTCGT 


TTCGTCTAAT 


GTTATTAGTA 


1380 


35 


AACGATAGTC 


TTCAATCTTC 


ATAATTTCCC 


C C CAT AAATT ATTCAATTAT 


TGAACTTTCA 


1440 




TGGCTACAAG 


CATTCATGAG 


TTCATTACTA 


ACGAATAATT 


TCACCAATTT 


TATTGGTATG 


1500 




GCTGCAGCTT 


GAATTACTTA 


GTTTTT CTTT 


TGTTGTTGGT 


GATTTTTAGT 


TTGATTATAT 


1560 


40 


TGCTTAGGCT 


TTATTTGTTT 


GCTTTTTTCA 


ATATTAGTTT 


TATTTTGTGG 


CTTTTGATGA 


1620 




TTTTTTTGAG 


CCTTTGCATT 


AATTTTATTA 


AAGCAGTACA 


TGATTTTCTT 


TTGGAATCCT 


1680 


45 


TTAAAATCAT 


TTTCTAACTC 


TGCCATAATT 


TGATGTGCAA 


TCATATATGC 


TTCATGAAAT 


1740 




TGCTTTTTTG 


TAATTTGCTC 


ACTTTCTAAT 


GCAAACATTA 


AATCATCTTC 


ATCTACCAAC 


1800 




TCATaTcACC 


ACTTGG 










1816 




(2) INFORMATION FOR SEQ ID NO: 352: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 



GCGTTGTCGT CGATGATTAA 


TAAAGTATGG 


GTATACCATT AAGAATAACG CTACC CAAAT 




gAkTGCTAGT GACGTGCCGC 


CAATGACATC 


TGAAAAGTAA TGTGCATGAA AATAAAGGCG 




ACAAAATAAT ATGCTAAG CC 


ATAATATTCC 


CATAACCAAC GCACTCAACA CTTTTGTTAT 




TGTCTTAGCA GCAAGTGAAA 


TAATAATGAT 


CATTAAGGCG AAATATAATA ATGTGCTGGC 




GTTGGAATGT CCACTCGGAA 


ATGAAAAGCC 


TGTATCAACG GCTAAATGAT TATATGGTCT 




TGGACGTAAT ACAGTATCTT TAATTAATTT GTTCATGATG ACACCTGAAA CCAAATATGT 




CACAAACCAA ACCGCTAAAT 


GCCTCTGTTT 


AATAAACAGT ATGATTGTGA CGATAATGGA 




AATCAACACG ACACCTTTGA 


CATCTCCAAT TTCCGCACTA AACGTCATAT AGTAATTAAA 




CAAATTGTTA ACATACTGAC 


GTTGTGGCTC 


ACCGAAATAA TCTGTAAACC ATGTTAATGA 




TCCCATATCT ATATTTTTAA 


GCCATTCTTG 


ATTTGTCACT ACACTGTAAA ACATACCTAT 




AAATACAATC AGCGCGATTA 


AAAATAAAGG 


CACTGTcATT TTCGGTGATG TTAATTTTTT 




ATCTAT CAT C TTACAATCTC 


CTCGTATCAT 


CATTTTCATT TTACAAATGT TAT C CATAAT 




ATCAATGTGC CACAAATTTC 


ACTTTACCGA 


CAATATCAAA ATTATAAAGT TCATATTGTT 




ATGTATATTG CAAATAAAAC 


ATTGTATAAT 


TGAAATAACA ATATTTTGCT ATTTTCAATT 




TAGTACGATT TATATTTATT 


ATACAGAGGG 


GGTAAGGCGT ATCAATAGAG TTATTTTTGT 




CTATATAGCG TTAATCATTA 


CATTAGTTAG 


ATTCTTTACC CCTATTCATC CATCATTTTC 




AAATTTGATT TACTGGATAT 


TTGTATTATA 


TTTTATTCCT ATTATACTAT GCGTTATCGG 




TTTCAAGGCC GAAAAACTTA 


TTGCAACAAT 


GGT CATTATA CCTAATTTTT TAGGAATACT 




TTATCGATTA TATGCCTACG 


T CACACATAT 


ACTCTTTATG TAAAAGGATA GTGGGCATGT 




CTCGCAACAA ATATGCTTGC 


GTCGACTTGT 


CACCATTCGC AAATTTTATG ACATACGCCT 




TTCACGGGCT ATATTCAGAC 


CCACGCATTC 


ATCCACGTAA TAAACACATC ATGTAATAGA 




AAAACAGCAC ACCCAAATAT ATGGCGTTGC 


GCTGTTTAAC CAAGCATACT TCTATAGCTT 


1320 


TAATAAGCCA GCAGAAGCAT 


ACCTAACCTT 


CTTAAATATG CTTTTCCAAA TTATCCTCAA 


1380 


GTTT GAATAC GATAATACGT 


TCACCTGTAA 


CTGTACTTAA ATCACTATGG AAGCTCATCA 


1440 


CTTTGATACC TGTAATTTTA 


AAAATGATAT 


CATTCAAATC TTGCTCACCG GATTCAACTA 


1500 


ATT CAGAACG TGTTCGTTTA 


ATATTTAATA 


ATCCTTCATT CGTACTACAT ACACGATATT 


1560 


CAGCTGGCGT TAAGATACCT 


TGTAAACTAA TAATCACCAT ATCTCTTAAA ATGTCTGATT 


1620 
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TTTCAGCTTC GATTTCACCT TTCGTTCTTT TCATATCACT AACTCCAATA ATATTAAAAT 1740 

TGATTACTTC ATCTTTGTAT CGTTATCACG ACATCAACTA TTACATTAAG TTTATCATTT 18 00 

5 TTAGTATATT TTAAGAAGCT AGAACATTGT AGATATGATG ATATATTAGT TACTTAGCAT 1860 

CGCAACATAT CATCGTTAAA TCCAACTTTT AAAACGCCCT TCCTCATTAA CGCTCATTAA 1920 

ACGCAGCCAA TGATTAGACA CCTTCCTAGC GAAATGCTCA TTATTCG CGA GTAGTCTTGC 1980 

w 

TACAACATAG TCGGGTGCCT GAATAACGAC AAG TAAACGA ATTGGCGAAT GATACATCGT 204 0 

CCGAT CAGCA GCCATAACAG ATTGCCATGA TAAGCCATAC AT CAGAT CAC TCGCATTACC 2100 

15 TTGCATGACA CCAACACCTG ACGTGACGGT TTGTGTCGCT TTATTCCCAC TTCCGTAAAA 2160 

ATGCGGCGCA ACTGTCGACG CATAATATTG TAAATTAATC CATTGTGCCA CAAGTGCCGG 222 0 

ACCAGAAATG ATGGTATTTA ATAATGTGCC ATCTTTATCT TTACGCCAAT CATAATTGTG 2280 

20 TAAAAATGTC CGCCCTTCTA AATCAATGCC TTTTGTTAAT TGGCGTCGTC CAATTATAAA 2340 

TGATGCATTT TTAGCCAATC CCCATTCTGG ACGTACCTCA CTCCAATCAC TCGCAAACCG 24 00 

CTGCGCTTCT TCCACTGGAT GATTCACACG ACCAATCG TT GGCAGTTTGT CCAAACGTTC 24 6 0 

25 

GCGATTCGcg TGtTCAGAAA TCATCGGCAT CGCGTCATTC AATGATTCAT ATG CATCTAA 2520 

AGCAATAGAA GATAATGTGT CTGGCACATA TACCCATGCC AACGTAT CAG TAGACGTATG 2580 

ATGTTCTGCT ACCGCAAAAA CAGTTGTCTC TGGAATATAC ACACCTGATT GTTTTAAT C C 2640 

30 

TTGTCTGACA TTTGGACGAT TACATATCAT CGCTAATAAC TTAGCATTAA AACCGCTTGA 2 700 

TGCGCCACCA CAAGCCCCAC ATTCAAGTGA TGCATGATGT GGATTATTGT GAGAATGACT 276 0 

35 AGCATGACCT GCTAACACAA CGAACGGCGC AAATG CTTCG GTTAAATCCA TCAATTTCAA 2820 

CGCTTGTAAC GCGAAATCAA TTTGCTCTTG CTCAGTAAAT CCAACAGGTA AGTCTGATGT 2880 

TCGGTCAAAC TCACGATCAA TCGTCAACTT TGTTTCAGGC TTTTTCAACC ACTTTTGTTT 294 0 

40 TATTTTTTGT AAAGACGCGC GACTTTTTCT AGGCATAATC GAATTGACAA TGGTACTTAA 3 000 

GCTTAAAAAT GGCCCACTTA ATTCAGGCAA TAACAGACTA GGCATGACAT TATTTTTCAT 3 060 

CAATTTAAAT GTGTAAAACA TCGATGACAT TGTCTGTTGC TGTTGTCGAT AAACATTCAT 312 0 

45 

ATCG TAGCGG TCTGCAAATT CTTTAATGCG AT ATGCCGG C GGTACCATGA CAGGTAATGA 3180 

ATCATGTTTG AATTGTTCGT CTACGGCATC TTTTTGAATA GGTAATCCAA AGAAGCCTGC 324 0 

50 AATACCAATC GTTTCAAAGG GCCCTGCTGC TTCGATATGT CTACG AAATG GTTCTGAACG 33 00 

AACATCTATA CAAAATGCAA TTTGCGCTTT CGTTGATGTG CCCATCTGAT TTAGCTCGCT 3 360 

ATTATTTTCA TCAACTGCTT GTGTGTCATT TAACAATACT GAATGTGGCT GATTAGCGTT 34 20 
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TGCTTTAATT TTTTGTTTTA ACTGAGATTC GTATGTCATT TCCCAGGCAA TTAGCCATAA 3 54 0 

ATTTTTAAAT ACATTTTTAT TCATAGTTGC TGCAAAATGA ATAAACGTTT GAATTTCATT 3 600 

GACGTCATGT TGTAGTAATA CATCGCTAGG CATATCACTG TAGTAACACC ATGATGCAAC 3 6 60 

AGTTTGCTTA AACCAATTTT CCGATCTACT TTCACAATCT TTAGCGACTG ACTTAAACTC 3 720 

ATCACCAACT AGCAATTGTT CGACAACTAA CCGAATTGCC AAATAATCCG TTAACAAATG 3780 

TTGTTCAAAG TGATGCTGTT GTGAACGGTA ATACAACATA CCTGCCCAAC CCGGTAACGC 3 84 0 

CAAAAGATGT CCTTCAACAT AAGCTTGGTA GTCTTCCTGA TCTATTGAAA AATGAGTTAA 3 900 

TACTGACTCT ATCGTCATTT CAGGATCATT GGGTAAGCCT TTAATCACTT GGCGCTGTGC 3 960 

TTTAGTAAAA CTATGGTCAT GTTGCGCTAA ATGCAACCAT GCATGGTAAA AACTTTGcTC 4 020 

ACGCTTCGGC ATTGTCCAAC TCGATAGAAA TTGATCGATA TAAAGTTTCG TCCATTTAAT 4 080 

CATTTGACGA TTCACTTGTT CGCTAAGTGG CTCACCTTGT TCATCTATTA TTGCATCACT 414 0 

CATCGGACGT ACATCATAGT GATGATATGA TTCAGCCATA TCACGTTTTG ATTTTTCTAA 4 2 00 

TAGTAGATCA GCAACAACAT CAACATTTGA ATGATTCATA TATGATG CAG GTACGTCTTT 4 260 

TAATGTTTTA ATGTTATCAA TATAAAGATT GATGTAGTGT TGCGGGATAT TGTAGTGATG 4 320 

TTCAAGTAAC ATATCAGTAA CAAGTTGATT AAAGACACTT TCATCTAATT CACCACGTGC 4380 

CACAGCGCTT TCTATTAATG CTTTATTTGG GAAAATATCC ACATCTCGAA CATCACGTAA 44 4 0 

CCATTTTGCG ACATCTTCAA ACGTATCCGC TTCTAATCCT TCCCATGGAT TTCGTGCTGC 4500 

AAAAATCGAA ATTGGTGATA ATGGTGTAAT AACACGTTTC GCATTTTCAA TGACTGAATT 4 56 0 

GATATTTAAC TGTGTTG T CA TACCTTTCAC CTCCTATAAA TACTTCTTCA AATAATTCGG 4 620 

ATGACTTTCT ATCGCTTTCG AGCGTGCTTC ACCTAGATTA ACTAACCACA CGTACAATAC 46 80 

CGCAAAAGCC TTAGAGTATC GATGCCGCGC CACCCAAATA CTTAATAAAC TGCCAAAGAT 474 0 

TAAAATAACA ACACTAATGA TGACACTCAC TGTAGGCGGC GTTGTCGCAT GTGTTGTTAT 4800 

ATTTTGTAAT ACAGCGTAAA AATAATTATG TGTGATGACG TAGATAAATG TCACGATTGC 4 860 

AATCAAAATC ATACCAACAA GACGTGCCAT GCGTCCTTTA CTAAAGGCTA CCATTTGATT 4 920 

C CAAGATAC A AGTAATGACC ATCCTAGAAT GAG TGCACTT AACACTTCAT ATG CACTTCT 4 980 

GTCACTACTC ATCCAAAATA GAAATGCCAC G ATAATAG CT AATACACGTC CCATGACAAT 504 0 

CCAGCCATAA GCGTCTTTAG CAGATGCTTG TTTTGGAATA TTGAATCGCT TCACGATAGA 5100 

ACCTGATTGT AAAAATAATG TTGCTTTAAA AATACCGTGC AATATTAAAT GAATAATCGC 5160 

TGCTGAATAT ACACCCAATG CACATTGAAC TAACATAAAG CCCATTTGAC TCATCGTAGA 5220 
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AGAAATACTA GAAAGGATAA G TAATAATG A TAACGCAAAT CCATTATCAA ATATCGGCGC 534 0 

AAAACGAGTT AGAATAACAC CACCTGCATT CACAATTCCT GCATGCATAA TTGCCGATAC 54 00 

TGGCGTTGGT GCCGTTACAG ATTCAATCAA CCATCGATGA AAAGGAAATT GTGCTGCCGG 54 60 

TATCATGACA GCTAATACAA GTAGTACATT CGTCAACAAT GACCATGTCG GATGAACTAT 5520 

ATGTTGTGGT ACCCGCCACT CGCCAGTCGC AATATAAATA GTTACAATTG CTCCAACGAA 558 0 

TGCAAGCCAA CCACATAAAA ATGTCATGCT TGATAATTTC GCAGACTCAC GTGGCACTTT 5640 

CCAAAAACGA TTAACGTTCA TCAGCAATGT TAAACATAAT AATGTAATAC CCCAGCAGAG 5700 

TGCCATCAGT CTTAAGTCTT CAGACATCCA TGCTAAAGAT GCAAACGACG TAATCGCAGT 5760 

GAACAATGGA AAGTAATGTC TATAATGATG ATCACCTAGT AAATATCGCA TTGAAAACTT 582 0 

TTGAATAATA AAGCCAAGCG CCATTACAAA GCCAGCTAAT AACCAAGATA AACGATCTAT 58 80 

TTTAAATGGA CCTAAGACAT GTTGACCATG AATACCGAAA AAGCCAATGA CTGCAAATAA 594 0 

TACTGGCATG ACTAGTATGT ATAAATGTAA TTTAATATAT CTCATTGGCA TAACTGGTGC 6000 

TAAAAACAAC AAGCCACTTA TCAATGCAAT GATAAGCGCA ATAACAAACA GTGAAAATAG 6060 

CAATTGAAAA CTTAACACTG CATAACCTCC TTATTTCTAA TCTCTCGCAT AATTGCTTAT 6120 

G T AT AAAAAT AAAAACCTAC AATAGTAGAT TCTGTACATA ATGGCAGAAA ATTTACTATT 618 0 

GCAGGTTTCA GTTTAACTAG ACACTGCATC ACGGTACGTT GATATACCTT GTTGCAGTGT 6240 

TCTCTTTAAG CGTGCTCCCA TGCACATATG TATATAAAAT GTTACTTCTG TCTGTTCAAT 63 00 

TCATCTTCAT AAATATGCTT TGCCTAGACG AGACCTAACG TGTTATT CGT TTTAAACTTA 636 0 

TAACATAAAA TATAATTAAA TTTCTGCTTC ATGTCAAATT CATGAGCTTA ACCTCTATTA 642 0 

AACCAATGAT TGTAAAGATT TTGTAAATGC ACCTGTACAG TTAGGCAGTA TTTCCCGTCC 64 80 

TTTTAAAATA AAAAATTCGC AGTTATGATC ATAACAATTC AAGTTAGGAA AAAAATCAAT 654 0 

TACGCACAAG ATAACTATGT ACAATGAAGT TAACTCATAA GCAAAGGAGG TAATCTTAAT 66 00 

GGGTATCATC GCTGGCATCA TTAAAGTTAT CAAAAGCTTA ATCGAACAAT TCACTGGTAA 666 0 

ATAAGATTTC ATAACAAACA AAGGAGGTCT TTCACATGGG TATCATTGCA GGAATCATTA 672 0 

AATTCATTAA AGGATTAATT GAGAAATTCA CTGG TAAGTA AGTTATAAAA ATCTCATAGA 6 7 80 

TATGAACATC TTATTTGAAG GGGGCCATTC ACATGGAATT CGTAG CAAAA TTATTCAAAT 6 84 0 

TCTTTAAAGA TTTACTTGGT AAATTTTTAG GTAACAACTA ATCTCAAACA TTAACGATCA 6 900 

ACAACTCATC ACTATGTTAA ATCAACATAC AGGAGGACAA AACGATGGCT ATTGTAGGTA 6960 

CTATCATTAA AATCATCAAA GCAATTATCG ACATTTTCGC AAAATAATTT AAGCGAATTG 7 02 0 
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TTATTGATGT 


GAGGTGAGTC 


TTGTTAGTTT 


GTTGCAAATA 


AATGGTCTTG 


GTG TTTTTTG 


7140 


TATAGGACGT 


TCTTAGTGGG 


ACATACGGAA TATTCGTGAT 


CTTTGTAGTC TGACG CGTTA 


7200 


TATTTTTGTG 


GCGTGTTTTA TGTTTGATAC TCGAGTTCTG AGACATTCAT G ATTTGG CAT 


7260 


GCGAAATCTT 


AATGATTTTC 


ATGATCTAGC 


GCAAGATATA 


TTGGCCACGT 


GCGGAATTGC 


7320 


GTTGCACGTT 


TAGACTGAAA 


CACTCGTGTG 


ACCGTAAGTG 


TTAATAGTAC 


ATTGATAGCT 


7380 


GCATTTACTT 


CACTCATTTT 


TATGACTGTT 


AAACAATGAT 


TGTACCTTCA ATTAACAGTT 


7440 


GGTACGATGG 


TTTTGCCATT 


TTTCATCAAC 


GTAAATATAA 


AAAGGACTAA 


GACACATACA 


7500 


TGTCCT AG C C 


CTATGGATAA 


AATGCAAATT 


TCTGCTTTAT 


CAAAACTATC 


ACACTTTAGA 


7560 


TAGATTGAAA 


ACAAAAAGAT 


CCTAAGAACA 


CCTTAACTTT 


TTATTAATTG 


TCATAAATTG 


7620 


CAAACAATTA 


AGCCACAATT 


CAAAAATGAT 


TATACTTCAT 


TCAACTTATC 


GTGCTGGTCT 


7680 


AATTTGCCAT 


TGATATGGAT 


CTTCAAATTG 


TTGCCAATCT 


GCATCAATTT 


CTTGCGCATT 


7740 


GACTAAGCAT 


GCGTCGAGTT 


CTTTTGTTAA 


TTTTTCTTCA 


TCTAATTCTG 


TACCAATAAT 


7800 


GACAAATTGT 


GTATGACGAT 


CGCCATATTC 


TGGATCCCAT 


TCAGCTGCGA 


CATCTTGACG 


7860 


TTCTGCTAAT 


ATTTGTGTTT 


GTTGCGCTTC 


AGACATACTA 


GCCACCCAAT 


ATGTAACTGG 


7920 


ATGAATATTG 


CAAGATGACC 


CTGCTTGAGA 


TAATAAACAT 


GCTACGTGAT 


TGTATTGTGC 


7980 


TAGCCATACG 


ATACCTTTTG 


ATCGAACGAC 


ATTATTTGGC 


ATGCTTTCTA 


ACCAATCATT 


8040 


GAACCTTTTA 


GCATGGAAAG 


GTAGACGACG 


TTTATATACA 


AACGATGATA 


TACCATATTC 


8100 


TTCTGTTTCA 


GGTGTATGCG 


ATGCATGCCC ACCAGACTCA 


AGTTCTTTGA 


TCCATCCTGC 


8160 


TGACTCGCTC 


GCTTTTTCAA 


AATCAAAACG 


CTGCGTATTC 


AAGACTTCTT 


TTAAATCTAC 


8220 


TTCAGAATTT 


GTTGTCTTAA 


TAATTTTAGC 


AGTCGGTTGC 


AATGCGCTTA 


ACATTTTTT C 


8280 


TAACTTCGCT 


AGTTCTTCTT 


CACTAATTAA ATCAATTTTA TTAATAATCA ATACATCACA 


8340 


AAATTCAACT 


TGGTCAATTA 


ATAAATCAGC 


AATCGAACGC 


TCATCTGTTT 


CGTCAACGCT 


8400 


TTGATCACGA 


TCCATCAATA 


AATCTTCTGA 


GTTGATGTCA 


TGTACGAAGC 


GGTTAGCATC 


8460 


CACAACTGTA 


ACCATTGTAT 


CTAAACGGCA 


AATCGCTGTA 


AGATCAATGC 


CAAGTTCATC 


8520 


ATCAATATAT 


GAGAAAGTTT 


GTGCAACAGG 


TACTGGCTCT 


GAAATCCCTG 


TTGACTCAAT 


8580 


AACAATTTGA 


TCGATGCCAC 


CTTTTTTCAC 


TAAACGCTCA 


ACTTCTTTTA 


ATAAATCGTC 


8640 


TCTAAGTGTA 


CAACAGATAC 


AACCATTAGA 


AAGTTCGACT 


AATTTTTCAT 


CTGTACGCGA 


8700 


TAGTCCCCCA 


CCATCTGCGA 


CAAGATCTTT 


ATCGATATTT 


ACTTCACTCA 


TATCATTTAC 


8760 


AATTACCGCG 


ATACGTCGAC 


CTTCTCGATT 


TTGTAAAATA TGATTTAACA 


ACGTTGTCTT 


8820 
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ACTTCAATTT ATTTGTAAAT AGGAATAATT CTGTTTTACA TTATATAGGA GCGTTTCCTC 8 94 0 

TTTCGCAATC TTCGATAATA AAAAAATAGT ATACTTAATT AAATTATTGA GCGCTTTACT 9000 

TTATAATGGA GACAAAGATA TATCTCACGA AAGAGAATCG AGGTGTATAA ACATGTTATT 9060 

TGTCATTTTA GTTTTATATG TTACTGGTAT TGCATTTATT CTACTCAGTG TTTTTGGTTC 912 0 

AAAGACTGAA GGATTATCTA CGAAACATAC TTTATATACC ATTGGCAGTG CTATTATAAC 9180 

GATTGCTATT TTCATTTCAA TTGGCTATGC CATTCAATAC TTAACTG CAG CGCTTTATGG 9240 

TTTGTAAGGT GAAGGTGATG AGTAACGGGT AGTTCGGGAG AGGTTAACTT G CGTTG ATTT 93 0 0 

TGATAAAGTG ATCAT AG CTT TTAGTACTTG AGGATTTTTA TTGTTGCTGT TACGAATGTG 9360 

GTCAT GTTT A ATGCGGGACA GTAATTTAAG TTGTTTTTTT ACAATTGAGA GTGTGATATT 94 20 

TCGATTCGGT TCGAATTACT TTACATGGGA ATAATATAAA TTAAAAAGAA GCGGCCTAGT 94 8 0 

GTCAGTTGTG AATATACTGA ACATTGGTCG CTTTATTTAG TAGTATGATA TGTAGTTTAG 954 0 

CTATTAATTT TTTTCAGGTC ATCCTTAATG CTGTCTATCT CAGACATGGC ACTTTTAACC 96 00 

CAATCTCCTT GAGCTGCACC TTTAAAATTA GCTTTAAAAg CtTCGCAATG TTGCGCCATT 966 0 

TGTTCAATTA ATACTTTTTC TTCACCTTTT AATCCGTTTT CAATATCTTT GTATTTATGC 972 0 

TTATGTTCAG GTGCAATAAC TGTGCGAATA TTTTCTTTTT GCGCTTCCAT TTTAGATATG 97 8 0 

AGATTAAGTG TTTCTACTGT AGTACTTATA TCTGGCATTC TTAAGGTCAT ATCTGGTTCT 984 0 

ATTAGAGTCA TTTAATCTCC TCCAAATTAT CAGTCACTTA GCTTATCTAA CTGCTTTTCA 990 0 

TAAGACTTTT TTAAGTCTTC TTTATATTCT TCTAATTTCC CATTCTTGCT TTCTGA 9956 

(2) INFORMATION FOR SEQ ID NO: 3 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

TTTCTTTTAA CAGaTTTATC CCATTAATTG TTC TACAAGC CCACGaTGAG CAATATCATT 60 

TTTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 120 

AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAACCAT TAACTAATGT 180 

ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT ATACAACTGC 24 0 

AACATCGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA TATCAGCTTC 300 
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AACAACTTTA TCGTTTGCAA CGACTAAACC TGCTTTAATC CAATCTTGTA TCTGGTTACG 42 0 

AG ACCAAT CA TTATTTAATT CAGGCAGCAA CTTATCTACA CG CAT AC CTG TTTGTTCTTT 4 80 

ATCTGTAATG TTAAATTCAT AAGTCTC CAT TACTTAACCT CCTTCTCCTT TTTATTGGAA 54 0 

GTATCCTTTA ATAAGGCAAT AATAATTAAT ATTACACCAA TTGTTAAACT TGAATCTGCG 600 

ATATTAAATA TTGGAAAATC ATAACCAAAA ATATTTGTAT CAATAAAGTC AACAACTTCT 660 

CCTGTTAAAA TTCTATCAAT AAAGTTTCCA AGTGCACCTG CAAAAAGTAA ACTAATAGCA 720 

ACTTGCATAA ACAAATTATA TTGAGCATCT TTAATAAAGA AATATACTAA GGCTATTAAT 780 

ATAATAATGG TAATAATAAA GAAAAATGTC ATTTTTCCAC TCAATATTCC CCATGcAGCA 84 0 

CCATTATTTC GATGTGaTGT TATGTTTAAA AAGTGCGGTA TCACTTCAAA TGAATCTCCA 900 

ATTTTCATTG TAGTAGCTAT AATATATTTA GTAACTTGGT CAAATATAAC GACAAATACT 960 

GCTATTAAAA TGGAAGTGCC AATAAAATAT TTTTTGTGCA TTTTCGTTCC TCCAATCAAT 1020 

CGTCCATGAG ACAACTCTTT ATATTATAGC TTACACCTGC TAATAAAAAA AGT AAG CAT A 108 0 

TTACATTAAA TCTAATGTTA CTAACTCAAT ACTTGATAAA CTACTATGTT TTGACATTAA 114 0 

ATATGAACTT AATTATTCAT TTATCATATT TAAGATGACA TTAAAAATTA GGAAAGCAGG 12 00 

CTGGAACATA AATCCCTAAA AAGACAGTAG TAAGATATTT TCTAATTAAA AATTATCTTA 1260 

CTGCTGTTCT CTATTTATAC AATACTTCGT ATTGAATGGC TTCGCTATGC CCATCTGGCA 132 0 

CATTACTGTA AAATTCTATA AATAGAATTT TTGATGATGG GTCCCTTCCT AGGGTGCCGT 13 80 

CTCAGCCTCG GtCTTCGACT GGCACTGCTC CCTCAGGAGT CTCGCCATTA ATACTACGTA 144 0 

TTAACATGTA ATTTTACTTT TAAATACTTT AAAAAAATAA GACATGAATC GTCTACACTT 1500 

AATTGGACAA ATTCTATGAG AATAGATATT GTTAATTTAA GAAAGTAGGC TATTTTGAGT 1560 

TtCACTCGAA TGTCAGTTCG AGGAATAAAT AAAGTTAAAC GAGAGCTAGG TTTTGTATTA 1620 

ATGGCAATTA ATATAAGGAA AATAGCAGCT CAACGAGCTG TACATTATAA AATACATATC 1680 

AAAAAAGCTG ATTTCTATCA AATAATTAAT AGAAATCAGC TTTTTTACAT TGCCTAAGAA 174 0 

CTTAATGTCC CAAGCCCTAA AACTTGTTGT TATTTATTTG ATTTAGCAGC GATACGTTTA 1800 

TATCTTAAGT ACATAAATGC TAAAAGTATA AACCAAATCG GAATAAAATA AATTGCACGT 1860 

CTTGTATCAA CATTAATAAA TAATAACCCG AACACAAAAA TGAAGAATAC AAATATTAAG 1920 

TAGCCCATAT ATTTGCCACC TAATAGTTTG TACGTAGCAT TTTTATGTAG ATCTGGGTTT 1980 

TTACGACTAT AATTGATATA TGCAATGATA ATCAGACCCC ATACAACTAA AAATAACACT 2040 

GTAGAGATGG TAGTCACATA CGTAAATACT TTTGTCGCAT CTGGGAAAAT ATAGTTTAGT 210 0 



1238 



EP 0 786 519 A2 

TTATTCGTCT TAGAAAAGTT CGGAGGTGCT TGTyGTTGAC TTGATAAACC GaAAAGCATA 222 0 

CGGCTATTTG AGAATATACC ACTGTTACAT GAT GAAGCAG CAGCGGTTAA TACTACAAAA 22 80 

TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 234 0 

TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 24 00 

TTnnATATTA A 2411 
(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

GnGAATTATT TTTAATAATG AAAGGATTAC TTnCATGGGT TTTTACTAGG AnTACCCAGA 60 

AGGTCAAAAT ATTTTTGATG CG CTAAGTCA ATATGAAGTT AAGCGACGCG GCGATATGGA 120 

AGAGGATCCA TCATATAAAC AACTCATTTC TTATTGTTTA CTTGAAAATG AGCATGGCGA 180 

GAT ATT AG TG TATGAACGAT TATCTGGCGG TGGAGAAGCT CGATTGCATG GACAATCTTC 24 0 

AATAGGTGTA GGCGGTCATA TGAATGATGT TCCAGG AG CA GAATCTATTA ACGAAGTATT 3 00 

GAGAGTTAAT GCACAGAGAG AATTAGAAGA AGAAGTAGGT TTAAGTGAGC AAGATTCACA 360 

AAATATGGAA TATATCGGTT TTATTAATGA CGATAATAAT GAAGTGGGCA AGGTACATAT 420 

TGGTGTTGTA TTTAAAATCA CTGTAAGTAC GAATGATGTA GAAGCTAAAG AAACAGATAC 4 80 

TTTACGAATA AAATGGGTTG AAAAAGGCAA CATAGAGTCA TATGATGATT TCGAAACGTG 54 0 

GAGTGCATTA ATCCTTCAAG ATTTATAATC AAACGAGGTG ACATATATGT CAGATATTAT 600 

TCCAG 605 
(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATTCATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 60 
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ACAATTAGAC ACTGAACATT TAGCTGATTT AAATCCAGAG CGTATGATCA TTATGACAGA 18 0 

TCATGCTAAA AAAGATTCTG CTGAATTCAA GAAGTTACAA GAAGATGCAA CATGGAAAAA 24 0 

GTTGAATGCA GTTAAAAATA ATCGCGTGGA TATTGTTGAC CGTGATGTTT GGGCAAGATC 3 00 

TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT CAAAAAAAGA 360 

ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 42 0 

CATAAAAGAA AAAGACGCAC AACACTCACG TATATAGTGA GTTTGTGCTT TCTTTTTATT 480 

TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTTAG CG A TATCATTCAC 54 0 

TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 600 

AGGATGATTG CAGGGTTATT TATTGGCGGT GCATTAGCGG TATCTGGTTT GTTAATGCAA 66 0 

GCAATGAC 66 8 

(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 787 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 56: 

ATACAAAAAA ACATATCGAA AATAAAGCTA AAAGAAACTA TCAAGTTCCA TATTCAATTA 60 

ATTTAAATGG TACATCTACA AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 12 0 

CAAATTACAA AAATTTAACT AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA 180 

GTGAACAAGA TTTAAAATAT GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 24 0 

GTAAAAGAAT CTTACAGTTG AATTCAAAAA ATTACACAGC AAACTTAGTT CATGCGAAAG 3 00 

ATGTTAAGAG AATTGAAATT ACTGTTAAAA CAGGAACTAA AGCGAAAGCA GACAGATATG 3 60 

TACCATACAC AATTG CAG T A AATGGCACAT CAACACCAAT TTTATCAAAA CTTAAAATTT 42 0 

CGAATAAACA ATTAATTAGT TACAAATATT TAAATGACAA AGTGAAATCT GTATTAAAAA 4 80 

GTGAAAGAGG CATCAGTGAT CTTGACTTAA AATTTGCGAA ACAAGCAAAA TATACAGTAT 54 0 

ATTTCAAAAA TGGAAAGAAA CAAGTAGTGA ATTTAAAATC AGACATCTTT ACACCTAATT 6 00 

TATTTAGTGC CAAAGATATT AAAAAGATTG ATATTGATGT AAAACAATAC ACTAAATCAA 6 60 

AAAAAAATAA ATAAATCTAA TAATGTGAAA TTCCCAGTAA CAATAAATAA ATTTGAAAAC 72 0 

ATAGTTTCAA ATGAATTTGT GTTCTATAAT GCAAGCAAAA TTACmATTAA TGaTTTAAGT 7 80 
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(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE-, nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

AAAGTAAAAA TAAATCTCCC TTTTTAACTT TCGTTTCTGC CATAGC CATT GCTTCTTCTG 6 0 

TGATAGTTGC TACAATATCT TTTCTTTCAC GGTTAAAATG TTCAACTTGT TCTGCTAAAA 120 

ATGCAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 180 

AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATTGTTTCT TCATCAATAT 24 0 

TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGTCGGACAT TGATCATTTA 300 

AGACTTTTAA TCCTTGTtTC ACTAATGATC GATTTTCATC AGTTAAGGAT ACTAAATCCG 360 

CAATGGTACC TATCGCAACT AATG CTTTAA AATAATCAGG TACATTTtCA ATCAATGCTT 4 20 

GTGCTAATTT GTATGCAACA CCTGCACCAC ACAATTGTTG GAACGGATAA TTAAACGATG 4 80 

GATGCATTGG ATGTACGATT GCATATGCTT CTGGTAATGT ACTACCAATT TCAT 534 



(2) INFORMATION FOR SEQ ID NO: 3 58: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

40 GGTGAGTCAA ATTAAATGAA TCTAATAAGT CATAACTATC TATTTGTAAT GTGCAACGCT - 60 

TAACGCATAT ACAAAATGAA TGTGCTGATA ATGATTTACT CAAATTAAAA GGTGATTTTT 120 

ATTCAATGAT GAATGAAAGT TGCCTTTTTA TTTTTGGTAA AAGTTAATGC GTCAGTGAAT 180 

45 TGTGTAAGTT TTTCAAAAAG TAAAAAGAAA TAATAAAGGT GAATTATTAG AATTCCArAA 24 0 

ATAATTCATT ACATTCATAA AGCATTTTAC AAATGGTAAG AAAATGAGTG TTACAAATCT 300 

AAATATTGCA AAAGAAGCTG ATTTAGTCAC AAAAAATGTC CTATGTAATA ATTCGAGAAA 360 

SO 

GATGCACTAT ATACGGTCTT CTTACTATTC AAATGTAAAA GTTGCTTATT TGCGTGGCTT 4 20 

TTTGTTTTAT AAAAGTATAA AATTTTACTA TAATATATCT TGTAGAGAAC AATGAAATGA 4 80 



55 
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ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTTAT CGATATGACG 600 

ACTTGAATAT GATAAAGTGA CATATTT AT G TATATGACTA TTTCGCAAAA TGTAATCGAG 66 0 

5 GTAGAATTTC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 72 0 

AGGCAAACAA GATGACTGAA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 7 80 

ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 84 0 

w 

GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 90 0 

CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 96 0 

)S ATGAAGAAAA TGAAACTGGA GCTTACATCT TATCTAGAAG ACAACTTGAA ACTGAGAAGT 10 2 0 

CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATGAAAT CAT CG AAGCG AAAGTAACAG 108 0 

AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 114 0 

20 TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 1260 

AAGAAGAAAA CGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCGATGTTA 13 2 0 

25 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCGGTGTTG 1380 

ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA GAAGAAGTAG 144 0 

TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 150 0 

30 

TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 156 0 

ACGAAAATGA TGT CATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTG 162 0 

3S AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTGCACAC AAACACATTG 16 8 0 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA TGTTAAAATA TTAGGTATTG 174 0 

ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTACCAAAC GAAGATGTTG 1800 

40 TTGAAAGTGA TCCTTCTACG ACT AAGG CGT ACTTAGAAAA CGAAGAAGAA GATAATCCAA 18 6 0 

CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 192 0 

AGTCAACTCC ACATGTTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG 1980 

45 

GAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGTTT TAAGTGCtGa 204 0 

tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTATTAAA 2100 

AAGATATaAG AAAGAGAGGA AGTTAGCTTA TGACTAAACC TATAGTAG CT ATTGTAGGTA 2160 

50 

GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 22 2 0 

TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 2280 

55 
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AAATTAGAGC GCAGGCAGAA ATCGCCATAG ATGAAGCGGA TGTTATTATT TTTATGGTTA 
ACGTGCGTGA AGGATTGACA CAAAGCGATG AAATGGTCGC TCAAATTTTA TACAAATCTA 
AAAAACCGGT CGTATTAGCG GTTAACAAAG TAGATAATAT GGAAATGCGT ACAGACGTGT 
ATGATTTCTA TTCATTAGGA TTTGGTGAAC CGTATCCGAT ATCAGGGTCA CATGGTTTAG 
GTCTTGGTGA CTTGTTAGAT GCAGTTGTTT CTCATTTTGG TGAAGAGGAA GAAGATCCTT 
ATGATGAAGA TACAATTCGA CTATCCATTA TTGGACGACC AAACGTAGGT AAATCAAGTT 
TAGTAAATGC TATTTTAGGT GAAGATCGCG TTATCGTTTC TAATGTTGCA GGGACAACGA 
GAGACGCTAT TGATACAGAG TATAGTTATG ATGGACAAGA TTATGTTTTA ATCGATACTG 
CTGGTATGCG TAAAAAAGGA AAAGTATATG AATCAACTGA GAAATATTCA GTATTAAGAG 
CTTTAAAAGC GATTGAACGT TCAAATGTTG TTTTAGTGGT CATAGATGCA GAACAAGGCA 
TCATTGAACA AGATAAACGT GTTGCAGGAT ATGCACATGA ACAAGGTAAA GCAGTCGTGA 
TTGT CGTAAA TAAATGGGAT ACTGTGGAAA AAGATAGTAA AACGATGAAG AAATTTGAAG 
ATGAAGTACG TAAAGAATTC CaATTTTTAG ATTATGCACA AATTGCTTTT GTGTCTGCTA 
AAGAACGCAC AAGATTACGT ACATTATTCC CTTACATCAA TGAAGCAAGT GAAAACCATA 
AAAAACGTGT TCAAAGTTCA ACTTTAAATG AAGTTGTTAC TGATGCAATT TCCATGAACC 
CTACACCAAC AGACAAAGGT AGACGTTTGA ATGTCTTTTA TGCAACACAA GTTG CTATAG 
AACCACCGAC ATTTGTTGTA TTTGTTAATG ATGTAGAATT AATGcATTTT TCTTATAAAC 
GCTATTTAGA GAATCAAATC CGTGCCGCTT TTGGTTTTGA AGkTACA CCA ATTCATATTA 
TAGCTCGAAA GAGAAATTAA CGATTGGGGG ATAACAATGA CTAAAATTAC CGTTTTTGGT 
ATGGGAAGTT TTGGGACAGC CCTTGCCAAT GTTCTTGCAG AAAATGGACA TGATGTTTTG 
ATGTGGGGTA AAAATCAAGA TGCTGTTGAT GAATTAAATA CATGTCATAC AAATAAAAAG 
TATTTAAAAT ACGCGAAATT A 
(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3621 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
CTTTCGGAAA TTAGGATCnG nGCTATCTTG GCCCCAATTA CCAAGGGAAC TAnTGGCACC 
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AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 18 0 

ATGGCATTTG TTACTACGAT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTGAT 24 0 

5 ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 3 00 

AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 36 0 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 42 0 

10 

GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 4 80 

ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 54 0 

1S GCATAATCTC CTTGTTCTTG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG GOO 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG 64 3 
(2) INFORMATION FOR SEQ ID NO: 36 0: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

TTTCAGGCyG TGCAAGGGCT TTTTCTTTTG CTTTTAAGAT TATGATTTAT CGTGCAAAGT 6 0 

30 

TAAGTGGTCG TATATAGTTT TAGTTTTAAA AAGGTAATTA AATAAAATAG TTTGCCGAGG 12 0 

GAGATGTCAA AATGATTAAA ATACCTAGAG GGACGCAGGA TATTTTACCT GAAGATTCAA 180 

3S AGAAATGGCG TTACATTGAA AATCAATTAG ATGAATTAAT GACATTTTAT AATTATAAAG 24 0 

AAATAAGAAC ACCAATTTTT GAAAGTACAG ATCTTTTTGC AAGAGGTGTT GGTGATTCAA 3 00 

CCGATGTCGT ACAAAAAGAA ATGTATACAT TTAAAGATAA AGGCGATAGA AGTATTACAT 360 

40 TAAGACCTGA GGGAACAGCT GCAGTTGTGC GTT CATATAT TGAACATAAA ATGCAAGGTA 420 

ATCCAAACCA ACCAATTAAA CTTTATTACA ATGGACCGAT GTTTAGATAT GAACGTAAGC 4 80 

AAAAAGGACG CTATCGTCAA TTTAAT CAAT TTGGTGTAGA AGCTATTGGT GCTGAAAATC 54 0 

45 CTAGCGTAGA TGCAGAAGTA TTAGCTATGG TTATGCATAT TTATCAATCA TTTGGATTAA 6 00 

AACATTTAAA GCTTGTTATT AATAGTGTAG GGGATATGGC GTCTCGAAAA GAATATAACG 6 60 

AAGCGTTAGT GA aACACTTT GAACCAGTAA TT CATGAATT TTGTTCAGAT TGTCAATCAC 72 0 

50 

GTTTGCATAC AAATCCGATG CGAATTTTGG ATTGTAAAGT AGACCGTGAT AAAGAAGCGA 7 80 

TTAAGACTGC ACCTAGAATC ACTGATTTCT TAAATGAGGA ATCTAAGGCA TATTATGAAC 840 

55 
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GTGGATTGGA TTATTATACA CATACAGCAT TTGAATTAAT GATGGATAAC CCTAACTATG 96 0 

ATGGTGcCAT TACAACGCTT TGTGGTGGTG GCCGTTATAA TGGTTTATTA GAATTGCTAG 10 20 

ATGGTCCAAG TGAAACAGGT ATTGGTTTTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 10 80 

TTGAAGAAGA AGGTATCGAA TTAGATATTG AAGAAAACTT AGATTTATTC ATTGTTACAA 114 0 

TGGGTG AT CA AGCAGATCGA TATGCTGTGA AGCTATTAAA TCATTTGAGA CATAATGGTA 120 0 

TTAAAG CAG A TAAAGACTAT TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 12 6 0 

GTTTAGGTGC CAAGTTTACA ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 1320 

ATGTTAAAAA TATGACAACT GGTGAATCTG AAACAATTGA ATTAGACGCA TTAGTCGAAT 13 80 

ATTTTAAGAA GTAGAGAGGG CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 144 0 

TTACTGAGGC ATTTTTAGGA CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 1500 

ACCTTGGTGG AT t G ATTTt C GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 156 0 

TTAATCCTGC ATTTTCAGAA GAGGCaTTGA AAATTGCTGA AACAGTACGT TCTGAATATG 16 2 0 

TTGTAGAAGT TCAAGGTACA GTTACGAAGC GTGACCcTGA AACAGTTAAT CCTAAAATTA 16 80 

AAACTGGCCA AGTTGAAGTA CAAGTTACAA ATATTAAAGT GATTAATAAA TCTGAGACAC 174 0 

CACCATTTTC TATAAATGAA GAAAATGTTA ACGTTGATGA AAATATTCGA TTAAAATACC 18 00 

GTTATTTAGA TTTACGTCGT CAAGAGTTAG CGCAAACATT TAAAATGAGA CATCAAATTA 186 0 

CACGTTCTAT TCGTCAATAT TTGGATGATG AAGGGTT CTT TGACATCGAA ACACCAGTAC 1920 

TAACGAAGTC AACACCTGAG GGTGCACGTG ACTATTTAGT ACCATCTCGT GTTCATGATG 1980 

GTGAATTTTA TGCATTACCA CAATCACCAC AATTATTTAA GCAATTATTG ATGATTAGTG 204 0 

GATTTGACAA ATACTACCAA AT CGTAAAAT GCTTCCGTGA CGAAGATTTA CGTGCAGATC 2100 

GTCAACCTGA ATTTACACAA GTCGATATTG AAATGAGTTT TGTAGACCAA GAAGATGTGA 2160 

TGCAAATGGG TGAAGAAATG CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC GTTGAAATTA 2220 

ATGGCGCTTT CCCACGCATG ACATATAAAG AAGCGATGCG TCGCTATGGT TCTGATAAAC 228 0 

CAGATACACG TTTTGAAATG GAATTAATTG ACGTTTCTCA ATTAGGACGT GATATGGACT 2 34 0 

TTAAAGTATT TAAAGATACT GTTGAAAATG ATGGTGAAAT TAAAG CAATT GTCGCTAAAG 24 0 0 

GTG CAGCTGa ACAATATACT CGTAAAGaTA tGGGaTGCTT TAACAGAATT TGTaAACaTC 24 6 0 

ymTGGCGCTA Ag G t TAG CGT GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA CAAGGTCCCA 252 0 

ATGG 2524 
(2) INFORMATION FOR SEQ ID NO: 361: 
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(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 361: 

TCGTTGAGTA AAAGTCCAGA AAATTGGATG AGTAAACTTG ATGATGGAAA ACATTTAACT 6 0 

GAGATTAATA TACCGGGTTC ACATGATAGT GGCTCATTCA CTTTAAAGGA TCCAGTAAAA 12 0 

TCAGTTTGGG CAAAGACTCA AGATAAAGAT TACCTTACCC AAATGAAGTC GGGAGTCAGG 18 0 

TTTTTTGATA TTAGAGGTAG AGCAAGTGCT GATAATATGA TTTCAGTTCA TCACGGCATG 24 0 

GTTTATTTG C ATCATGAATT AGGAAAATTT CTCGATGATG CTAAATATTA CTTGAGTGCT 300 

TATCCAAACG AAACAATTGT GATGTCTATG AAAAAGGACT ACGATAGCGA TTCTAAAGTT 360 

ACGAAGACAT TTGAAGAAaT TTTTAGAGAA TATTATTATA ATAACCCGCA ATATCAGAAT 420 

CTTTTTtACA CAGGAAGTAA TGCGAATCCT ACTTTAAAAG AAACGAAAGG TAAAATTGTC 480 

CTATTCAATA GAATGGGGGG TACGTACATA AAAAGTGGTT ATGGTGCTGA CACGT CAGGT 54 0 

ATTCAATGGG CAGACAATG C GACATTTGAA ACGAAAATTA ATAATGGTAG CTTAAATTTA 600 

AAAGTACAAG ATGAGTATAA AGATTACTAT GATAAAAAAG TTGAAGCTGT TAAAAATTTA 660 

TTGGCTAAAG CTAAAACGGA TAGTAACAAA GACAATGTAT ATGTGAATTT CTTG AG TGT A 720 

GCGTCTGGAG GCAGCGGATT TAATAGTACT TATAACTATG CATCACATAT AAATCCTGAA 78 0 

ATTGCAAAAA CGATTAAAGC AAATGGGAAA GCTAGAACGG GTTGGCTGAT TGTTGACTAT 84 0 

GCAGGATATA CGTGGCCTGG ATATGATGaT ATCGTAAGTG AAATTATAGA TAGTAATAAA 90 0 

TAAGGATTCA ATAATGATAT TAAGACGAGT ATGAAAATAG TTAGATTCTA ATTATTTTCA 96 0 

CTACTCGTTT TTATTTTGAA AATAAGTAAT AATTCAACAA TATTATAAAT TGAACAGATT 102 0 

GTTTGTGAAA TTTTTGATAA TATTAAAGTG AAAAAGTGTT ATAAATTGAT AAATATATGT 1080 

AATTAACAAA AACAAATCAT TTTAAAAAGA AGAGAGTTGT AAGATGATGa AACGATTAAA 114 0 

CAAATTAGTG TTAGG CATTA TTTTTCTGTT TTTAGTCATT AGTATCACTG CTGGTTGTGG 1200 

CATAGGTAAA GAAGCGGAAG TTAAGAAAAG CTTTGAAAAA ACATTGAGTA TGTACCCTAT 1260 

TAAAAATCTA GAGGATTTAT A CGATAAG GA AGGCTATCGT GATGATCAGT TTGATAAAAA 1320 

TGATAAAGGT ACATGGATTA TAAATTCTGA AATGGTTATT CAACCTAATA ATGAAGATAT 1380 

GGTAGCTAAA GGCATGGTTC TATATATGAA TAGAAATACC AAAACAACAA ATGGTTACTA 14 4 0 

mTATGTCGAT GTGACTAAGG ACGAGGATGA AGGAAAACCG CACGACAATG AAAAAAGATA 150 0 
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(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 





TACATGTTTC 


GGATGCTACT 


TTATTTAGTT TGAAGGGTGC ATTATGGACG TTAGCGCAAG 


60 




AAGTTTATCA 


AGAATGGTAT 


TTAGGATCGA AGTTGTATGA AGATGTTGAA AAGAAAATAG 


120 




CACGAACTAC 


TTTTAAGACA GGTTATATTT ATCAAGAAAT TATTTTGAGA CCAGTAGATG 


180 




AAGTTAAGGT 


ACTTCTGAAT 


GATTTAAAAG GTGCTGGTTT CGAATTAGGT ATTGCAACAG 


240 


20 


GTCGTCCTTA 


TACTGAGACT 


GTTGTGCCAT TTGAAAATTT AGGATTG TT A C CATATTTTG 


300 




AAG CTGATTT 


TATTGCAACA 


GCAAGTGATG TTTTAGAAGC AGAGAATATG TATCCGCAAG 


360 




CACGACCATT 


AGGAAAGCCG 


AATCCTTTTA GTTATATCGC AG CTTT ATAT GGTAATAATC 


420 


25 


GCGATAAATA 


TGAATCTTAT 


AT CAATAAGC AAGATAACAT TGTAAATAAA GATGACGTAT 


480 




TTATAGTAGG 


CGATTCGTTA 


GCTGACTTAT TAAGTGCTCA AAAAATAGGT GCAACGTTTA 


540 


30 


TTGGAACATT 


AACAGGTTTA 


AAAGGTAAGG ATGCTGCAGG TGAGTTAGAA GCGCATCATG 


600 


CCGACTATGT 


TATTAATCAT 


TTAGGTGAAC TTAGAGGTGT ACTAGATAAT TTGTAATTTG 


660 




ATTGTTGTTT 


GACAGCATAA 


CTTGTAGTGA ATGATTGAAC CAAAGGTTTC ATATTGAGTT 


720 


35 


ACAATGAAAT 


TAATAATGAA 


AAAATGCCAA GAAGCAATGG AAGTAATCCA ATGTCTTCTT 


780 




GGCATTTTGA 


ATTTACATAA 


ATTGTTTATG ACTGTACCGT CAATTCAGTT GTGAAAATTT 


840 




GATTGTATTC 


ACCAACTTGT 


TTAAGTTCAT CAATTATATT GTTTGAAACA GGTTGATCAA 


900 


40 


CGGATAAAAT 


CATTAGCGCA 


TCTCCGCCCG CTTCAGTTCT ACCTAAAGTC ATAGATGCAA 


960 




TGTTGATATT 


GTATTTACCT 


AACAATGCGC CAGTTTTTCC TACCATACCT GGAGTATCAT 


1020 




TATGATATGA 


CACAATTTGA 


TATTGATTTG GCTTTAAGTC TACAGAAAAA TTATTAATTC 


1080 


45 


TAACAATTCT 


TGGACCGAAA CCTGTAAAGA CAGAAGCGCC AACTTTAACG GAATCGCnAT 


1140 




CGnTTGATAG 


TTCTACCTCT 


AAGTAGTTAC TAAAACCTGT CTCTGCTTTA TTATTTTCAA 


1200 


50 


TATTTAATGT 


CACTTG 




1216 




(2) INFORMATION FOR S 


EQ ID NO: 363: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 

55 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3G3: 
AT CAAAAT AT TTTAAATAAT ATTGATGGTC ACATTGTAAA TrmAATAGAA AATAAATTTG 6 0 

ATCAAATATT ACAAGAACCA TTAAATCCAT TAAATTATGA TACTGTCAGT GGATTAGCTG 12 0 

GGATAGGGAG ATATTTGCTA AATAGAGTAG ATGAGAATGA ATTTAATGTT AAAGCATTAA 180 

AAAGCATATT AGTATACTTT AAAGATATTC AATATTCTAA AAATAGCTGG GTAGTCCCAC 24 0 

AAGArAGTCA ATTTTTAGAG TCTGATAAAA ATTATTTTAC TGAAGGTAAT ATCAAT CTTG 300 

GCCTTGCACA TGGaGTGCTA GGACCGATGT CTTTATTTGC ATTATGCGTG ATTAAAGGAA 3 60 

TTACGATTGA AAATCATCAG CACATATTAA AAGACATGTA CAAATTTATC ATGGACGAAA 420 

AATTTTGTAA CCACGAAAGA TGGTTGCAGC GTTACGATTT AATTTCTGAA CGTAATCATT 4 80 

TCAATTTTAT TCGGAATGGT TGGTGTTATG GCAATACGGG TGTAATGACG ACGTTGTTTT 54 0 

TAATCGGCCA AGCATTACAA GATGATGAAA TAATTAAAAT GTCTAAAAAA GTGATGCTAC 600 

AAGTAGTAAA TGATAAAGAT GAAAATTTAA TAAGTCCAAC TATTTGTCAT GGATTGTCAT 6 60 

CACAAATATT AATGTTAACA ATTATGAATT TGAATTTTGA ATTAAATGAA GTGTCTGATT 720 

ATATCACTGT ATTAATAAAT AAACTGATTT CTCATTATAA GGAAGATTAT CTGGTGAATT 7 80 

TTATAGACAT TAATGAAAAT AAGCAAGATG TATTTAAAAG TAGGAAAGTT GGCCTTTTAG 84 0 

AAGGTGAATT AGGGGTCATC TAACATT 867 
(2) INFORMATION FOR SEQ ID NO: 3 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaATCCtG 6 0 

ACATCCCTGT ATCa CTATG A CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 120 

AAAATCCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACTTTTA 180 

TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA TCATAATTAA CGTTGAATAA 240 

AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCGTTTATTA 3 00 
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CAATCGCATG CCG CAGTTAA TTATTATAGT AAAAACCAAT GTACATGGTG GGCATTTAAA 420 

CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT AATAGATGGG GCAATG CTAA AAATTGGTAT 4 BO 

TACAATGCAC GTAAATCAAA ATATGCGACT GGTCGTACAC CAAGAAAATT TGCTGTCATG 54 0 

CAATCAACTG CAGGATATTA TGGACATGTC GCAGTTGTTG AACAAGTATA TAAAAACGGT 600 

AG T ATTAAAG TTTCAGAATA CAACTTTTAT CGCCCATTAA AATACAATAC ACGTGTACTA 660 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT ATTTACTAAT CAAAAAACTT CTATCACGAA 72 0 

CGCTTCAATT TCCTGTATGC GTGTGATAGA AGTTTTTATT TTATGAAATT ATATTATTAC 78 0 

TTCTACAAAT TTCAAATTGC CGTAATTGAA CGTATATTTC TTCTTCAACT ATTATTTCAT 84 0 

CTTTAGCATA ATCTATATAT AAAATTTTAT GCTATTATTT AAATAATTCG CTATAACTTA 90 0 

ACATACGTTT TCGATATAAA CCTTGTTCTA AATCTCAATA ATTTTTTGCT GTTTTCATCG 960 

TCATTAGTTA AAAAAATAAT TTAACTGAGT TTTTGAGACC TGTTTAAATT GATGACTCAA 1020 

AACCTTTATC CCCTTTTCAC TCGGTTTAAT TGCTTTAATA TTTAACACAG TCTCATTGTA 1080 

ATTTTGCTAC TAAGTTTGAA ATATTTCGAT TCGAATGTGA AcATCATTTA TTATTACTTT 114 0 

CGACAGCTAG AAAAATATTG TTAAACCAAA ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

ATATAAATTT TAACTTTGAG ACAGAACTGC TAGTCAGGTT TATGAATATA TTTCTTTAGT 126 0 

TTACTTGaTA TACTTATTGG TAAATCATTA TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

ATAATCCCAG TTATAGCGAT TCCATGACCA TTTTTATCAG TCAAAGATTC ATCAACAAAA 1380 

ACTTT TTGAT AAATTTATAT ATTTGTATAA TTTATTATGG TAGATATCTA CACCCGTATC 1440 

TATAACAGCT ACAGTTATCA TTGCGTCTTA TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

TCATTAATAT CATGGCTATT TGTAACGTCA GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTTAATTTTT TAAATTTTTT TCTAACCTTT TTCAAAATAA AATG CTCTTT AATTCCATCT 1680 

CTTAAATACA TATAAATCTC TATTATAATC TCCATAAATG TCATATCTCC TCTCTCAAAA 1740 

AACAT TATAC TCAATATATT AAAAATAAGA AAGGTTTTTT CAGAATTTTT AGTATTTTCA 1800 

GATATTTATT CTTAAGTATC TTATTATTAC TTCTTTTTAA CG CTAGTGG a ATAATTAATA 1860 

ATAATTGTAT TTAAATG CAA TGCTGTTGTT CTTTCATACT TACAAGCAAG TCATAAGAAA 1920 

TGAGAAATTA AATTCTTATA TGATGAATAA ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

GGAGTGGAGT GAGGCATTCT ACTGTTCATA TTGTTAACAA ATATCAAAAT GATAGGAACT 2040 

TGAAGAATCT AAGACTGTTA AAATCGAGAC AAGAAGAAAC T CG TTCCAAA TTTGAATCCT 2100 
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AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATGCGTG TGATAGAAGT TTTTATTTTT 2220 

GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 22 80 

5 

TATTCGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCGCT GCTTGTGTAT 2 340 

TTGCACGGGG TTCTTTTTTC AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 2460 

10 

ACACAAGCGC GCGACATGCT GCAAAGTTTT GCGCTAAACC AACGGCAGCA ACTACATGAC 2520 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCT CTAATGA AGCTTTAGCA ATTGGTAATA 25 8 0 

,5 CTTTTGTACC ACCGCCAACG ATTGCCAATG TCATAGGCAC TTCAATTGTA CCAATCAATC 2640 

GTTGACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2700 

CGTATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 2760 

20 CATGTATGCC ATTCATAACA CCTTTATTAT GTGTTGCTGC ACGATGAATA TCTACTTGGG 2 820 

CCAATACAGA AGCACGTTCC ATTCGTTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 28 80 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GGACGCTGTT GCATGATTGG 2 94 0 

25 ATAAAATACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 3000 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATC TTTCGTATCA ACAAATACTT 3 060 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 3120 

30 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCGCAATTTT ATGAATTTGC TTTTCTAAAG 318 0 

CTTTAATGTC TGCTGATAAT TTTT CAGTAT CGTCAACGCC ATCAAAGACG ATTTGACCTA 324 0 

3s TCATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 3300 

CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACGGGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTCAATTAA ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 34 80 

ATTGTTCTTC TGATAACCAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 3540 

AATGTCGAAA ATTCTTAT CT AAATTTTGCA TGGACGTACT CCTTTTACTT CACATAATTT 3600 

45 TTAACATTTC AATCACTACT ATTTTTACCA CAAAATAACG TCATTCGTCT TAAAATTCAA 3660 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACG CTTACAAGAA 3720 

ACGACAAATC ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAG CAT ATTCTGATAT 3780 

SO 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 3840 

ATAAAATAAA CTTTTACGTT CCAAAGTACT ATGTAGACAT GGCTAAATTA GCAGAAGCAC 3 900 
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CTGTAAACCA AGACATCGTT TCAATGGGCG CTAACGCTGC TAAGGACATT ATAACAGACG 4 02 0 

AAGACAAAAA GAAAATTGGT ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 4 08 0 

AAGCAGCCGC TGTTCAAATT CACAACTTAT TAGGTATTCA ACCTTTTGCA CGCTGCTTTG 414 0 

AAATGAAAGA AGCTTGTTAT GCTGCAACAC CAGCAATTCA ATTAG CTAAA GATTATTTAG 4200 

CAACTAGAC C GAATGAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATGGATTGA 426 0 

ATTCAGGCGG CGAGCCAACA CAAGGTGcTG GCGCAGTTGC GATGGTTATT GCACATAATC 4 320 

CAAGCATTTT GGCATTAAAT GAAGATGCTG TTGCTTACAC TGAAGACGTT TATGATTTCT 43 BO 

GGCGTCCAAC TGGACATAAA TATCCATTAG TTGATGGTGC ATTATCTAAA GATGCTTATA 444 0 

TCCGCTCATT CCAACAAAGC TGGAATGAAT ACGCAAAACG TCAAGGTAAG TCGCTAGCTG 4 50 0 

ACTTCGCATC TCTATGCTTC CATGTTCCAT TTACAAAAAT GGGTAAAAAG GCATTAGAGT 4 560 

CAATCATTGA TAACGCTGAT GAAACAACTC AAGAGCGTTT ACGTTCAGGA TATGAAGATG 4 62 0 

CTGTAGATTA TAACCGTTAT GTCGGTAATA TTTATACTGG ATCATTATAT TTAAGCCTAA 4 680 

TATCATTACT TGAAAATCGA GATTTACAAG CTGGTGAAAC AATCGGTTTA TTCAGTTATG 4 74 0 

GCTCAGGTTC AGTTGGTGAA TTTTATAGTG CGACATTAGT TGAAGGCTAC AAAGATCATT 4 800 

TAGATCAAGC TGCACATAAA GCATTATTAA ATAACCGTAC TGAAGTATCT GTTGATGCAT 4 86 0 

ATGAAACATT CTTCAAACGT TTTGATGACG TTGAATTTGA CGAAGAACAA GATGCTGTTC 4 92 0 

ATGAAGATCG TCATATTTTC TACTTATCAA ATATTGAAAA TAACGTTCGT GAATATCACA 4 98 0 

GACCAG AG TA GTCGGTGTAT TTAAAACACA TATAATAAAA CCTAAAAGCA GCAGTAAGAC 504 0 

CACTTCTAAT TGAAATCGTC TTACTGCTGT TCTCTATTTA TAACACTTCG TATTGAATGA 510 0 

ATTCATTATG CCTATTTGAC ACATTATTGA AGTTTTCCTA ATG CCTGGAT CCTTTATACG 516 0 

TTACGGCTTC GTGCTATGTT TTGGTACATA AAGCTTTGAC ATATCGATAT TCTCCAACTC 522 0 

TAACAG CTT A ATTTTATTAT TAATCGTTCC ACCGAACCCT GTTAAGCTAC CCGTTTTACC 52 8 0 

GACAACACGA TGACATGGCA CGATAATAGA TAATGGATTA CTTCCGACTG CACCTCCAAC 534 0 

CGCTTGGGCT GACATTTTTG GCTTGTTAAG CAGCTTGCCT ACTTTTTTGG CAATAGCACC 54 00 

ATACGTTGTT AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA 5460 

ATGACTACCT GTTGG CTTTA AAGGTATTGT GATTTCAGGA TTGTCACCTT TAAAATACGC 5520 

GTCTAAC CAC TGTGTCGCCT CTCTAAATAT CGCTAAAGAC GTATTTTCTT CCCTAGTACC 5580 

ATCACCTTGT TGATTTTCAA ACAAAA CAG C GGTCAGACTT ACCCCATCAC TCAAAAGTTC 564 0 

CAATCGTCCT ACAGGCGAAT CATAGTAACT CTTATACTCC ATAAAAATTC CCCCTTTTTC 5 700 
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ATAAGTCGTC AATTACGTAT ATAAACACGT AATACCAGCT ATCACTTTGC TGCAATATAC 5820 

AGTTACATAT CTTACTACAC GTGCTAACCT CTTACTTTGT AAACCAAATC TTAAATTAAA 5860 

5 

ATATTGAAAA TGCAATGAAT CCTTAATATT TTATTAAACC TATAATTACT TATTAAAAAT 5940 

AACACACAAT ATTCATAAAG TTTTAAAAAT ATTCTGTTTT ATCACCTACT ATTAGTGGAA 60 0 0 

AAGTACAATT GCAATTGTAT ATAGTTTGCA TAACGCTTCA AAAGTAATTT CTTTTTTGTT 6060 

10 

TAGTTCAAAA AAATTTAGAG GTGATGTTAT ATGAATAACG GTTTTTTCAA TAG CGACTTT 6120 

GATTCAATTT TTCGAAGAAT GATGAAAGAT ATGCAAGGTT CAAATCAAGT CGGAAACAAA 6180 

;s AAGTACTATA TTAATGGTAA AGAAGTTTCA CCTGAAGAAC TAGCGCAACT CACACAACAA 624 0 

GGTGGCAATC ACTCTGCTGA ACAAAGTGCG CAAgcTTTTC AACAAGCAGC ACAAAGACAA 6300 

CAAGGGCAAC AAGGTGGCAA CGGCAATTAT TTAGAACAAA TTGGTCGTAA CCTTACG CAA 6360 

20 GAAGCACGTG ACGGTTTATT AGATCCAGTC ATTGGTCGTG ATAAAGAAAT TCAAGAAACT 6420 

GCTGAAGTTT TAAGTAGACG AACTAAAAAC AATCCTATAT TAGTTGGAGA AGCTGGTGTT 64 80 

GGTAAAACTG CGATTGTTGA AGGTTTAGCA CAGGCAATCG TTGAAGGAAA TGTACCAGCA 654 0 

25 GCAATCAAAG ACAAAGAAAT TATTTCTGTA GACATTTCAT CATTAGAAGC TGGAACGCAA 6600 

TATCGTGGTG CTTTTGAAGA AAATATTCAA AAATTAATCG AAGGTGTTAA ATCTTCACAA 6660 

AATGCCGTAC TATTCTTTGA TG AAATC CAT CAAATTATCG GTTCAGGTGC CACAGGAAGT 6720 

30 

GATTCAGGTA GCAAAGGGTT ATCTGATATT TTGAAACCTG CATTAAGTCG TGGTGAGATT 6 7 80 

TCTATTATTG GTGCAACAAC ACAAGATGAA TATCGAAACA ATATTCTTAA AGATGCTGCA 684 0 

TTAACGCGCA GATTTAATGA AGTGCTTGTT AATGAACCAA GCGCTAAAGA TACTGTTGAA 6 900 

35 

ATTTTAAAAG GTATTCGCGA AAAATTCGAA GAACACCATC AAGTAAAATT ACCAGATGAC 6 96 0 

GTATTAAAAG CATGTGTTGA CTTATCAATT CAATATATTC CACAACGATT ATTACCAGAT 702 0 

40 AAAGCAATCG ATGTGTTAGA TATTACAGCA GCACATTTAT CTGCGCAAAG TCCAGCTGTC 7 080 

GATAAAGTTG AAACTGAAAA ACGAATTTCT GAATTAGAAA ATGATAAACG TAAAGCAGTA 714 0 

AGTGCTGAAG AATATAAAAA AGCTGACGAC ATTCAAAATG AAATCAAATC ATTACAAGAT 72 00 

45 AAATTAGAAA ATAGTAATGG TGAACATACT GCTGTTG CTA CAGTTCATGA TATTTCAGAT 7260 

ACTATTCAAC GATTAACTGG TATTCCAGTT TCTCAAATGG ATGATAACGA TATTGAACGT 7 320 

TTAAAAAATA TTTCTAATCG TTTAAGAAGT AAAATCATAG GTCAAGATCA AGCTGTAGAA 73 80 

50 ATGGTTTCAC GTG CAATTCG CCGTAATCGT GCTGGGTTTG ATGACGGCAA CCGTCCAATT 7440 

GGCAGTTTCC TATTTGTTGG CCCTACTGGT GTTGGTAAAA CAGAGCTTGC TAAACAATTA 7 500 
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GACACAACAG CTGTTTCAAA AATGATTGGT ACAACTGCTG GTTATGTTGG TTATGATGAC 7620 

AATTCAAATA CGTTAACTGA AAAAGTACGC CGTAATCCAT ACTCAGTCAT TCTATTTGAT 76 80 

5 

GAAATCGAAA AAGCAAATCC ACAAATTTTA ACATTGTTAT TACAAGTAAT GGATGATGGT 774 0 

AATTTGACTG ATGGTCAAGG TAATGTCATC AACTTTAAAA ATACAATTAT TATTTGTACA 7 BOO 

TCAAATGCTG GCTTTGGCAA TGGCAATGAC GCTGAAGAAA AAGATATTAT GCACGAAATG 7 86 0 

10 

AAAAAATTCT TCCGCCCTGA ATTCCTTAAC CGCTTCAACG GCATCGTTGA ATTCTTACAT 7 920 

TTAGATAAAG ATGCATTGCA AGATATCGTC AACTTATTAT TAGACGATGT ACAAGTTACA 79 8 0 

15 TTAGACAAAA AAGGTATTAC GATGGACGTT TCTCAAGATG CGAAAGATTG GTTAATTGAA 804 0 

GAAGGCTATG ATGAAGAATT AGGTGCACGT CCATTAAGAC GTATTGTTGA ACAG CAAGTA 8100 

CGTGACAAAA TTACAGATTA CTATTTAGAT CATACAGACG TTAAACATGT GGATATAGAT 8160 

20 GTTGAGGATA ACGAATTAGT CGTAAAAGGT AAATAACGAC ACTTTAACAT ATCGCGCATC 8 22 0 

AAAAATGAGC ATCAGGTCGC CCTTGCCTGT GCTCATTTTT TTAATTATTT CCCTGGAAAA B280 

TGATTCGCTG TGTGCTGTTC TGTTCCACAA CAATCACGAT TAATGTCACA TGTACCACAT 8 34 0 

25 TTTCCTTGTT TTGAACGCTT GAAAAATTTT ACTAGTGTAT ATAAGGCATA TCCGAAAATT 84 00 

GCTAAAAAAA TTAAAATGTT AATAATGACT GACACTTTAA CCACTCCTTA AACAAATAAA 84 60 

TGTCCGACTT GATAAAAAAT GAATGTTAAG ACATATGCAG TGACTAGAGG ATAGGCAACT 8 52 0 

30 

GCAAGTGCCG TCCATTTCCA TGAATAAGTC TCTTTACGGA TTGCTGCTAC TGTAGAAACA 8 58 0 

CAAGGAATAT ACAATAGTAT AAATATCATA AATGCATACG CAGATAGCGG TGTGAATTGA 864 0 

TTTTGAATCA CATTAACAAG GCCTGCATCA CCTGATGAAT AGATAATCGC CATCGAACTT 8700 

ACGATAACTT CTTTTGCTAA AAATCC TGGC AcTAACGTAG CACCTGCTTG CCATGTTCCA 8 76 0 

AATCCGAGCG GTTGCACTAA CATACCAAAG AAACTACCAA CCATATGTAA AAAACTTTGA 8 820 

40 TTGATATTCA CATTGATACC ATGTGGTCCT ACATAACTTA ATAGCCAAAT GACTACTGAG 8 88 0 

CCGCCAAAAA TAAATGTACC TGCTTTACGA ACAAAGCCCT TAGCCTTTTC CCAAGTACTA 8 94 0 

CGCCACAACG TTTTAATGGA AGGCACACGG TATGTTGGCA ATTCCACAAT AAAGATTGCA 90 0 0 

45 TTATCATTTT TTAAAATCGT CTTAGTAAGT ACTGTACTGA CTAAAAATGC CATAATAATA 90 6 0 

CCTAAAACAT ACAGGCTTAA TACTACTAAA GATTGATTCT CTTTGAAAAA GATACCTACG 912 0 

AACAACGCAT ACACTGGCAG TCTAGCAGAG CATGACATGA ATGGTGCAAT TAATATCGTT 918 0 

50 GTTAAACGCT CTTTTTCATT TTCAATACTG CGCGCAGCCA TAATACTCGG TACATTACAA 924 0 

CCAAATCCGA TAATCATTGG TATAAAAGAC TTCCCGCTTA AACCGAACGA TTC CATAATA 9 30 0 
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AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTACACC AGCAATAATG 94 20 

CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA GATAATTCAT AATCGTCTTC 94 BO 

ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 954 0 

ATCCATGTAA ATGTAGTTTG AAAGATCAAC CACATAATTG CTAGAAAGAT AGGCATCCCT 960 0 

ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTT AT CTTCAT CT 9660 

GGATACGTTA CCACGTCTTG CAATAACGTC TCAATATAAT GATTGCGTAT ACGCTCCATC 9720 

TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GAT CACGTAA ACTCGACAAT 97 80 

TTATTTAGAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 984 0 

ATCGCAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGTCATTAT TATTTGACAC 9900 

ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATG CGGTTGATAC 9 96 0 

CCTTCCCCTA GATGCTTTAT TTCG CCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 10020 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTTTATC 100BO 

CCGCGCTTTG TAGCTACATC AATCATATTT AATCCGATGT ATATTGGTTG ATTAAGTTCT 1014 0 

AACAATTGTA CTG TTAATTG CATATTTCTT TTTAGTTGAC TCGCATCAAC AATGTTAATG 10200 

ATTCCTGAAA ATGAATCGTT TAATAAATAG TCTGTCACTA CAG TTTCATC TTTAGAAATC 102 60 

GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 1032 0 

CCTACTTTTT TCTCTACCGT TACGCCACTC CAGTTGCCTA TATATTCATA CGAACCAGTT 103 BO 

AAAG CGTTAA ACAAAGATGT TTTACCAACA TTAGGATTTC CTAAAATACA ATAATTTTCC 10440 

ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAACAAGC ATCGCAATGT CTAATACTTA 10 500 

ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG 10560 

TTATG AT AG C GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 10620 

CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 106 80 

TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC ATAACCCTTA 10 740 

AAATTGTAGC TCGCAATACT TTATTTAAAT AATTTTCATT TTTCATGTAA AATTTGTGAC 10 800 

ATTGCAAAAA TGT 10B13 
(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 8 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 





TGATGGATTA 


GCAGACATTT 


TACGAGCGAA TGGTTTCAAA GTGTTTGGTC CAAATAAGCA 


60 


s 


AGCAGCTCAA 


ATCGAAGGCT 


CAAAATTATT TGCTAAAAAG ATAATGGAAA AATATAATAT 


120 




TCCAACTGCT 


GATTATAAAG 


AAGTTGAGCG AAAAAAGGAT GCTTTAACAT ATATTGAAAA 


180 




CTGTGAATTG 


CCCGTTGTTG 


TCAAGAAAGA TGGGTTAGCT GCTGGGAAAG GCGTTATTAT 


240 


10 


TGCAGATACT 


ATTGAAGCAG 


CCAGAAGTGC TATTGAGATT ATGTATGGTG ATGAAGAAGA 


300 




AGGTACTGTT 


GTATTTGAAA 


CGTTTTTAGA AGGTGAAGAG TTCTCGCTAA TGACATTTGT 


360 




TAATGGTGAT 


TTAGCAGTAC 


CTTTCGACTG TATTGCACAA GATCATAAAC GCGCATTTGA 


420 




TCATGATGAA 


GGACCAAATA 


CTGGTGGTAT GGGGGCTTAT TGTCCmgTAC CACATATTAG 


480 




TGACGATGTT 


TTAAmACTTA 


CAAATGAAAC AATTGCACAw CCCATTGCAA AGGCAATGCT 


540 


20 


TAATGAAGGT 


TATCAATTCT 


TCGGTGTATT ATACATTGGT G CTATTTT AA CTAAAGATGG 


600 




TCCAAAAGTA 


ATAGAATTTA 


ATGCCCGTTT TGGTGATCCT GAAGCTCAAG TATTATTAAG 


660 




TCGCATGGAA 


AGTGATTTAA 


TGCAGCATAT TATTGATTTA GATGAAGGAA AACGTACTGA 


720 


25 


ATTCAAATGG 


AAAAATGAAT 


CTATTGTAGG GGTCATGTTG GCATCAAAAG GATATCCTGA 


7B0 




TGCATATGAA 


AAAGGG C ATA 


AAGTAAGTGG CTTTGATTTA AATGAAAACT ATTTTGTTAG 


840 




TGGATTAAAG 


AAGCAAGGTG 


ATAC CTTTGT TACTTcAGGT GGTAGAGTTA TACTTGCCAT 


900 


30 


CGGAAAAGGT GACAATGTAC AAGATGCACA GCGAGACGCA TACAAAAAAG TATCACAAAT 


960 




ACAAAGTGAC 


CATTTATTCT 


ATCGTCATGA CATTGCGAAT AAAGCACTAC AACTTAAATA 


1020 


35 


AGTAAATTTA 


AAATACTAAG 


aTTAGCTATG AACGAATCTA TAACGATAGA TTTTTT CAT A 


1080 


GCTTTTTTAG 


TTGTAGAGTC 


TAGGACATTG ATTTCTGTAC CAAATTTGTG ATTATGCATA 


1140 




TGTAATACAA 


AAGAGGCGCC ACAACATGTT TGGATGAACA AAATAACATG TTTGTGGCAC 


1200 




CTCTTTTGTT 


TAGTATGGAA 


TAAATGGTTT TCTTTTTCTA TACAATGAAT TTCTAATTTA 


1260 


GTATCTATAC 


AATTATGGAT 


AAAATTTAAC CTACACGACC AAGACGAACA TCATCTATGC 


1320 




CCGTGATGGG 


TAAGGTGATT 


GAACAATAAT ATGCCATAGT AATAATGGCA ATTAAAACTA 


1380 


45 


TAATAAAGAT 


TATATCTTTA 


TATGAGAAAG GTACGTTGTA ATAGTAAGTA CGAGGACCAT 


1440 




CTCTAAATCC 


TTTCGACTCC ATCGCAACTG ATAATTGATG TGCCTTTCTA ATATTTTGGC 


1500 




TTAATAGAGG 


TATAATTAAA 


TGCTTAAATC GCTTTAACCC TCTATAATTT GCCGCGTCTA 


1560 


SO 


TCATCTGATA 


GCGCATTTTT AAAGATCTGC GAAgcTGTAt TAAAGAACTA ATCATTAAAG 


1620 




GTATCATACG 


AATGGCAGCC 


ATGAATGCAT AAGCAACTTT TGATTTAACC TTTAAATGTT 


1680 
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ATGAAATAGC AATGGTTCTT AATGATACAT GTAAACCACG AACTAAACTT TCTGTTGTAA 18 00 

TATGGATAAA TCCGAATTTC AAAATTGTAT GGCTACCATT CCCGTATAAA ATCATGAACA 186 0 

5 GGGAAGAGAG TAATGCAAAG CCAATACTTA TAGTTATAAA AATTGCTGTA ATTTTAAACT 192 0 

GAGTACCATT AAACATCAAT AAGAAAACTA ACATTAAGAT AGTGATATAA AGCATAAAAT 19 8 0 

CGAAATTATG CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGTTTCGTTA 204 0 

w 

TAATGTTGAC ATCATCAACA AATGATTGCC GAACTTTCCA TTGCTCATAC ATTCGTATCA 2100 

CCATCACAAT CTAGTAACGC ACCATCTGAA ATTTTAAGTC TTCTTGATGG ATAACGTTCA 2160 

ATTATTTCAT CGTCATGTGT AaCCATGACA ATACTTTGTC CCAAATTAAT TCGCTTTTGG 2220 

IS 

AAAAGTTTGA TCAACTGGAA TGTATTATGG CTATCAAGTC CAAATGTCGG TTCATCTAAA 22 80 

AAGATAATAT CAGCTTTAGA ACTTAGTGCG GTAg CTACGC TAAGGCGTCG TTTTTGACCA 234 0 

20 ATAGACAACT CATAAGGATG TTGATCTTTT ACATTTTGTA AATCTAAAAG TTTTAAAAGT 24 0 0 

TGTATCGTTT CATCATCACT TTG AT CTTTA GAAAGGTGAT TAAAATGAAT GTTAATTTCA 24 60 

TCATAAACCG AATTTGTTAT AAATTGTAAT TCTGGGTTTT GATAAACTAG GTACATGTGT 2 520 

25 TTTGCTGCAT GTTTAATTTT TGTTAAACGC TGATTTTCAA AATAAACATC ACCTTGATAT 2580 

TTAATCAATT GCATAATTGA TTCAAGCAAG GTTGTTTTAC CACTACCATT TGCCCCTGTA 2640 

ATTGTAATCC ACTCACCTAG ACCAATTTCT AAATCTGAGA ATGAGAGCAA TGTTGATTTA 2 700 

30 CCGCGAATAA TACGTCCATT TTTAAATTGT AATAAGTGTG AGTTTGTTGT TGGAAAGTCA 2760 

ACACGACTTG GTGCGAATTC CCATGCACGT GGATGCCACA CACCATATTC ACTGAGTAAA 2820 

TGAACATACT TCTGTAATAT GATTTCAGGA CATTCAT CGG CAATGATATT TCCGTTATAA 2 8 80 

35 

TCCATCAAAA TGACGCGGTC GACATGATTC CAGATGTGTT TAACTTTATG TTCAACGATT 294 0 

ACAAC CGTTT GATCTTCCCA AAGTTCAATT AGTTTAGTCC ATAAATCTTC TGTTGCTTGA 3 000 

^ ACATCTAACA TTGCTGTCGG TTCATCTAAA AACAATGTTT TTGATTGTTG AAGAATGGTT 3 060 

TCAACAATTG CCAATTTCTG TTTCATCCCG CCACTTAAAT CTTTGATATA CGTTTCAGGG 3120 

GTAACATTTA AATTGACCAT ATTTAAAGCA TTGATAATTA ACG CAT CCAT GTCTTCACGT 3180 

4S GGTAATTGTC TATTTTCTAA AACGAATGCA AGTTCTTCGT ATACTTTTGG CATACAAAAC 3240 

TGGCTATCAG GGTCTTGGAA AATAACGCCA CTTAATGGGT CAACGATTAG TTCATCATAT 3300 

TTCATAGGTA ATTCAATTAA ATTAGGAACA ATACCACTTA ATACATTCAG AAGTGTACTT 3360 

SO TTACCGCAAC CAGAAGGACC GAGTAAAAGT ACTTTTTCTT TGTCTTGAAT AGTGATATTT 3420 

AAATGATCGA AAATTTTACG TTGACCACTT GGATATTTTA ATCGTAAATC ACTTACTTTT 34 80 
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ATTTTGTTAC GCCTGTCTTA TCTAAAGCTT TTACTAAAAG GTAAGATAGG ACGCCGGCGA 3 GOO 

CTACTGCACC ACTAATTAAT CTAAATACGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 3 66 0 

CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATTGCTGTA CAAAAACCTG 3 72 0 

CTAACATAGC TACCATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 3 780 

ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA CCCATAATGA 3 840 

TAGTTTCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATACCTGGT TTTGGAATAA 3 90 0 

TTAGATAGCA GACAATCGCT GCCATGAACC AAACCCCGTT TGTTAATTGT TCGAGGTGAA 3 96 0 

GGCCTGTAGC TTGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTGCGAATA 4 02 0 

CTACTGAAAT AAGTACGGTT ACTAGTATTT CAGATAGCTT TAAACCTTTT GACATTTTTA 4 08 0 

CATCCTCCTA ATAAAAAAAC G CACAAC CAT CCATAGGAAA GTTATGCGTT CACAATATAT 414 0 

ATTAGTAAAA CATATGTATA GTAACACTTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 4 2 00 

AAGGGTTTGA GGGCAAGCCT CATCT CAGTA TAAAACACCC CTAGTGTGTG CGATTTATTT 4 26 0 

AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGTCAAT AGGTTGTCTT CATGAAATTT 4 320 

CGTTTAATTC GATTTAAAAT TTATAATATT AGCATTGGAT TTAAATTGAA GATGTAGTAG 4 3 80 

GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCTGATAAAT 44 4 0 

GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 450 0 

CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 4 56 0 

GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AGAGGGATAC ATGCGCCGAG CACATGCATA 4 62 0 

AAATCCCCTA ACAACTAAAA GTTGTAAGGG GATTTAAATT AATTTAGTGT AT CTTGGATA 4 680 

TCTTGTTTTG kTTGaTTAAT ATCTTCTGTT TTTTCTTCTT TTTTATCTTT TAATTTTTCT 474 0 

TCAACTTCTT TAGCTTTTTC TGCTGCTTTT TTATTTTGAT TTTCATTAGA CATGATTAAT 4 800 

TCCTCCCAAA TTGGATAATT ATTTATATAT AAATCTTACC CGGTTGTACT TTCGTTAAAC 4 8 60 

TTTTCTAAGT CTATAGCACT ATTTATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 4 920 

TATAATGATT TTGAGGTGAA CATAATGTCT TTTCTTAGGA AACACGCCGA AATTATTTTT 4 9 80 

AGCTATTTAA TCGGTmwCGT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 5040 

TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 510 0 

TTCTGAATGC ATTTTTCAGT GAAATTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 516 0 

CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTG ATTGGTCATA 52 2 0 

CATTACTTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTTTCTTA GTTATCGGTA 52 80 
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TTTTCATTAT TCCATTTACA ATTCATATAG GATATATCGT CTATAAAGAT GAATTGAATC 54 00 

AGGAAAATGT AAAAAATCAT TTCATGTGGA TAATTGTGAG TTATGGTATA AGTTACTTAA 5460 

TTACACAAAT TGCATTGTAT GGCAGAATTG ATGCTAATGA AATAGAGTCA ATTGATATCT 5520 

TAAGTGTCAA TGCTTTCTTT ATAATTATGT GGTTACTTGG TCAAATGGCT ATTTGGAATT 5 580 

TCTTGTTCTT GCGCCGAGCT TTACCTTTAA CAAAGCAAGA ATTAGGTGAA GAGGAGCCAG 564 0 

AATTATCAAG AACAAGTAAA GGGAATGTCA CGAATCAAAC TAAAATTCAC TTGAAACAAC 570 0 

TCCAAGATAA GACTACAGAA TATGCACGTA AGACAAGAAG AAGTGTCGAT TTAGATAAAA 576 0 

TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 582 0 

ACGATATTCC TGATTGGATG AGAAAACCGA AATGGGTTAA ACCAATGTAT GTCGAACTAT 5880 

TTTGTGGTGT CGTCATCTTT TTATTCACAT TTTTAGAATT TAATAATCGT AATGCATTAT 5 94 0 

TTGTATCTGG TGATTGGAAA TTATCACAGA CACAATATGT TATTGAATGG GTTACATTAT 6 000 

TAATTCTGTT ATTCATTATT ATCG CATATA TCGCTACAAC GTTAACTTTC CACTTGAAAG 606 0 

GTAAGTTTTA TTATTTACAA TTAT TTATGG GGAGCATTTT ATTCTTTAAA TTGTTAACGG 612 0 

AATTTATAAA TATAATGATT CATGGACTAT TACTTTCAGT GTTCATTACG CCAACATTAC 6180 

TATTAATGTT ATTGGCAATC ATCATTTCTT ATTCGTTACA ATTACGAGAG CG AC CAT AAT 624 0 

TAAAAGCATT ATAAAAGTAC TATCTATTAA ACATTTTGAT GTGTACGCTA TAAGTTAGAT 63 00 

ATATCTCTAA CTTACTTAGA TACAGG TCAA TGAAGTTTAT GGATAGTACT TTTTTTGTAA 6360 

CTAGATTTGA TTGATTCAGG TGATGTGAAT TAAGTATTGA rAATTGTATA CAAAGTTTTA 6420 

GTG CAAATAA AATAGTTGAA AAGTTATCCA TTTGTAAAAT CAAGAAAACT AGTAAATAGT 64 80 

TGAAGCGACT TATGGaATTT G CG AAACGAT ATATAGTATT TCCTTTGTAG AAATTTmACA 6 54 0 

TATATCATTC AAATTACTAA TTTGTTAAAA TCAACAGTAA GATTAGAAGT AGATGATATT 660 0 

GAAATTTGGC AAACAaTtTA ATCTATATAA AACTACAACG AAACACAGAA AGGAAGTTGT 666 0 

CAGATGAAAA TAG CAACTCT GAACAAAGGC AAAGAAACAA AATATTTTAA TGGATATCCT 6 72 0 

TTAATTGAAG AAGAGGATAT CTATTCACAA GATCATTTAA AAGAAGGAGA TATTTTTCAA 6780 

ATTGTGACTG ATAAATCACA ATAT 6804 
(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE ; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

aaaAGAGACG CATTaAAACA AG CT ATACAA ATTATCGATA AATTAACATG GGGTGTTTAG 60 

TGGTGGTTAA AGAAATTTTG AGACTATTAT TCTTACTAGC GATGTATGAG CTAGGTAAGT 12 0 

ATGTAACTGA GCAAGTATAT ATTATGATGA CGGCTAATGA TGATGTAGAG GCGCCAAGTG 18 0 

ACTTTGAAAA AATCAGAGCT GAAGTTTCAT GGTAATAG CT ATTATCATTT TTGAATTAAT 24 0 

TATATTAATG TGTGTAGCAA TAG CACTGGA GGTGTTGTAA ATATGTGGAT TGTCATTTCA 3 00 

ATTGTTTTAT CTATAT nTTT ATTGATCTTG TTAAGTAGCA TTTCTCATAA GATGAAAACC 3 60 

ATAG AAG CAT TGGAGTATAT GAATGCTTAT CTTTTCAAGC AGTTAGTAAA AAATAATGGT 4 20 

GTTGAAGGTT TAGAAGATTA TGAAAATGAA GTTGAACGAA TTAGAAAAAG ATTCAAAAGC 4 80 

TAAAGAGAGG CGTTGGCTTC TCTGCTCTAT CyAAAATAAT GAAAGG AG C C saACATGTTA 540 

GaCmAAGtCA CTCAAATAGA AACAATTAAA TATGATCGTG ATGT CTCATA TTCTTATGCT 600 

GCTAGTCGTT TATCTACACA TTGGACTAAT CACAATATGG CTTGGT CTGA CTTTATGCAG 660 

AAGCTAGCAC AAACAGTTAG AACTAAAGAA GATTTAACTG AGTACAATAA AATGTCTAAG 720 

TCTGAACAAG C CG AT AT AAA AGATGTTGGC GGATTTGTCG GTGGATATTT AAAAGAAGGC 7 80 

AAACGGCGTG CTGGTCAAGT CATGAATCGT TCAATGCTAA CACTTGATAT CGATTATGCA 84 0 

GCCCAAGATA TGACTGACAT ATTATCTATG TTTTATGATT TTGCATATTG TTTATATTCA 900 

ACACATAAGC ATAGAGAGAT AAG TCCAAG A CTGCGTTTAG TGATTCCTTT AAAACGAAAT 960 

GTAAATGCAG ATGAGTATGA AG CTATTGGG CGTAAAGTCG CAGATATCGT TGGCATGGAT 1020 

TACTTCGATG ATACAA CTTA TCAACCACAT AGGTTAATGT ATTGG C CTTC AACTAGTAAC 10 80 

GATGCGGAAT TTTTCTTTAC CTATGAAGAT TTACCTTTGT TAGACCCAGA TAAAATATTA 1140 

AATGAATATG TTGATTGGAC TGACACATTA GAATGGCCAA CGTCTTCAAG GGAAGAGAGT 12 00 

AAGACTAAAA GATTAGCAGA TAAGCAAGGC GACCCAGAAG AAAAGCCGGG AATTGTTGGT 12 60 

GCATTTTGTA GAGC CTAT AC GATAGAAGAA G CT ATAG AAA CTTTTATTCC TGATTTATAC 13 20 

GAAAAACATT CTACTAACCG TTATACCTAT CATGAAGGTT CAACTGCAGG TGGATTGGTG 13 80 

TTATACGAAA ATAACAAGTT TGCCTATTCT CATCATAATA CGGATCCCGT AAGCGGTATG 14 40 

CTTG TGAACA GTTTTGATTT AGTACGCATA CACTTATATG GTGCTCAAGA TGAAGAAACT 1500 

AAAACAGATA CTCCGGTTAA TCGACTACCT AGTTATAAAG CAATGCAGCa AAGAGCG CAA 1560 

AATGATGAGG TTGTTAAAAA GCAATTAATT AATGACAAAA TGTCTGATGC AATGCAGGaT 1620 

TTCGATGAAn GAGAAAATAG CGATGATG CA TGGTCTGAGA CG TTnGAAAT TACTTCGAAA 16 8 0 
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(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH -. 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 





AAGATTAGGC 


ATCAAGACAG 


GGTCGCGATT 


GTTTGAAATC 


CCACATAGnA 


ATGATATTTa 


60 


IS 


CaTTATCAAT 


CCaAGTATGC 


GTAAATATCT 


TAATGTTTCA 


GTTGCTATTT 


CTAAGATTGC 


120 




ATTGCGTTAT 


ATTCCACCTG 


AAGATTTACA 


CCAATATAGT 


ATTGACGAAT 


TTTTTATGGA 


180 




TGTTACTGAT 


AG CTATCATA 


GATTTAGTTC 


TACAGTACAT 


GCATTTTGCG 


AAAGACTTAA 


240 




ACGTGAAATT 


TATGAAGAAA 


CAGGCATTTA 


TTGTACTGTG 


GGCATTGGTT 


CTAATATGTT 


300 




ATTAAGTAAA 


ATTGCTATGG 


ATGTTGAAGC 


GAacATAGTC 


AAAATGGTAT 


AG CTGAATGG 


360 




CG ATATCaAG 


ATGTACCAAC 


GAAATTATGG 


CCAATTCmGC 


CCCTGCGAGA 


TTTTTGGGGT 


420 


25 


ATT AAT CGTC 


GAACAGAAGC 


CAAATTGAAT 


AAAAGAGGAA 


TTTTTACTAT 


AGGAGATTTA 


480 




GCGAAATATC 


CATATAAATT 


TTTAAAAAAA 


GAGTTCGGTA 


TTTTAGGTGT 


TGATATGCAT 


54 0 




CTACATGCGA 


ATGGGATAGA 


TCAGAGTAAA 


GTACGTGAAA 


AGCACAAGAT 


CAGCAATCCA 


600 


30 


T CGATATG CA AAAGTCAAAT 


ATTAATGAGA 


GATTATCATT 


TTGATGAAGC 


AAAAGTAGTA 


660 




ATGCAAGAGT 


TAATTGAAGA 


TGTTGCTAGC 


AGAGTTCGAG 


CAAGAAAAAA 


AGTGGCAAGA 


720 




ACGATACATT 


TTGCCTTTGG 


CTATAGTGAT 


GAAGGCGGTG 


TACATAAGCA 


ATATACTTTG 




35 


AAAGATCCAA 


CAAACTTAGA 


AAAAGATATT 


TATAAAGTAG 


TAATGCATTT 


CGCAGATAAA 


840 




TTATGTAATA AACAAGCACT 


ATATCGTACG 


CTAAGTATAT 


CTTTGAGTCA 


ATTTATTAAT 


900 




GAGGATGAGC GACgTTAAGT 


CTGTTTGAAG 


ATGAATACCA 


ACGCAAACGT 


GACGAATGTC 


960 




TAGCTAAAAC 


GATAGACCAA 


TTACATTTGA 


AATACGGCAA 


AGGTATTGTG 


TCCAAAGCAG 


1020 




TATCGTTTAC 


AGAAGCAGGT 


ACAAAACACG 


GCAGATTAGG 


TTTAATGGCT 


GGACATAAAA 


1080 


45 


TGTAATGACT 


ATACGGTTTA 


AGTAATATAT 


AACTGTGATT 


CGTATAAAAT 


AAGTCTCTAA 


1140 




AGATAAATAT 


TTCATATATC 


ACAATAGATT 


TTCACAATAA 


TATCTAAGAA 


TACATGGAAT 


1200 




TTATCAAAAG 


AGACTTAATA 


ATTATTGGAT 


ATAACAATCA 


AAATCACTCA 


ATGCTTGCAT 


1260 


SO 


ACCGCGTTCT 


CGGTCAGTAG 


GGTTTTTGAA ACTAATTTTT 


AAAGCACCGT 


ATATATCTTC 


1320 




GCGTACTTCT 


AAGATTCTTA 


AGTTGCTTAT 


AGATATGTTA 


TGTAAACTCA 


GGATATAAGT 


1380 
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TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1500 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTC TCTAAATCTT GAAA CTGACT 1560 

TTTTAGCTGT CGAATCATTT CTAAAATATA C GTTTTA TT A CTCAAGGTGA TATCTTTCCA 162 0 

CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 16 80 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 174 0 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 1800 

AATAAATTTA GCAAGAGTAG GTGATAACAG TT C TTTTAAC GTGTTTG 184 7 
(2) INFORMATION FOR SEQ ID NO: 36 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 6 0 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 12 0 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 180 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 24 0 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 300 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 36 0 

TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 4 20 

GTGTAC CATT TTCCATCCTC TTCAATGACA ATGTTGC CAA TATCAAATTC AAGGACCTTG 4 80 

CCGTCCTCTG AAGT 4 94 
(2) INFORMATION FOR SEQ ID NO : 3 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 
GTC CATATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTT a AATCTACTTC 6 0 
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GATAATTTCn AATGTAGCGA CCATCATTAC GAGATGATTT GATAT AAG CA CAGTT TGGAT 180 

GTTGACCAAT ACTATCGCCT TCTT CTTCGA TGATATCTAT TTTAATACCA TCATCAGCTG 24 0 

CAATTTCTAA TGAAGATTTA ATTCGGTTAT CAAATGTTGA ATATCCCATT GCTCCACCCA 300 

CAATAGCG A C ATCTGTACCA TGTCCTTGGT GTGTTTGAGC AAATGATTCA TAATAATGTA 360 

TTTCAATATT TTTAGGAATA TCTCCCAATA TTGCGCGTGC TGAATTCCCA ATCTTTACTG 4 20 

CACCAGCCGT ATGAGAACTT GAAGGGCCCA TCATAACTGG TCCGATAATA TCGAAAGCAC 4 80 

TTTGATAATC ATAGCTCTTT GCCATAATTA AACACTCTCC TTAATATGAT T CTTTT TGCT 54 0 

CGGCATTTTA AAGTTGATAT TCATTAAATT AAACTTATTA ATTAGTGTTC CAATAATATA 600 

GGCTAAGACA ATGCTGACAA TAATCACTGT TGCGATTGTT ACAATTGATG TCACTGCATT 66 0 

ATTAAAGCCA AACAATACGA TGGCGCCTGC AATTGGTGTT GCCATACCTT TGACAC CTAT 720 

TACTAGTCCG CTAAATGTCA CGATA CATGC GTTGACGACC CCAATCAGTG CATTTGTACC 7 80 

ATATAGTTGT ACTGGATATT GCGCTATTAA ATCAATTTGC GTCAATGGCT CAATACAAAC 84 0 

TGCAAATGCT TTTGACGGTC CACCAATGTT TAATTTTCGG AATAAAATAA GGTTAACAAA 9 00 

TGAGCTACCT GTACATGTTA GTGCTCCAAT AGCCATAGGA ACACCTGTCA GTCCTAATAA 960 

ACTTGTTAAT ACCATTGAAC TTAG CGGTGT CATACCTGTA ACAGGAATCA CTAGTC CTAA 1020 

AATGACCGCT AATG CATATG GATTGTTATC ACCTACCGCA GTGACAGCAC TAC CTATTTG 10 8 0 

TTTTAATGTT GCTAGCACAC CAGGTGTAAT GATTGATGCA AGTCCGAAAG CAATTGCTGG 114 0 

TGCAAATAAG ATCACCACAA TTAAGTCCAA GCCTTCTGGA ACTTTCTTTT CAATCCATTT 12 00 

AATTAAAAAA GcTACGCCAT AAGCTGCGAT GAATGCTGGT AATAATTTAA AGTCATGTAA 1260 

TACTAAACCA ACAATGACCG CAAATACTGG TGCAACGCCT AAGTTTAAGC ACGTTAGAAT 132 0 

ACCTACTGcG ATACCGCTTA AACTTCCTGC TAAATCCCCA ATATCTTGTA GAAATTTAAT 13 80 

ATCAAATACG CCACCAATAG CATAACTTAA GAATGCTTGT GGTAGAAATG TCGCACAAGC 144 0 

TGCAC CGGAT AATGCTTGTA GTCCTTGTTT AC CGTACGGT GCATACTTTA AAAATAGCGT 1500 

CATGATCACT AAAACCAAGA CTAATGTGCC TACACCTAAC AGAATATCCA TTTCCCAAAA 1560 

ACCTCTCTCT ATGTTTATTT TATTTTCAGA CCATAAACAT CGTACACCCT TAAGAAAACG 1620 

TTTTCAACTT TTATCTGTTA TCAAATCaAA TATTTAAGTG AAATATTTCA TATTTGTGAA 16 80 

AGATTTTTAA AATGGATTGT TTCAAAAACT ACTTATGTTG TCGTTAATAT TTACTAATTA 174 0 

ACTTTTTACT CTATATTTCA AACAGTTGTG TGACAGTTTT TTGATAACTT TTTTACATCT 1800 

GAAAGTAAGT AATATTTCTA AAAACTTTTA ATATTTATAC ACTTTAT CTT TCGAGCTATT 1860 
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ACACATATAT TTGCAATAAG ATAATTAAAG TAGGATATTA TTTTTAGTTT TCTGATAGGA 1980 

ATGATGATAG TCTATAGGTT GAATCTTTAC T T T TTT TTAA AGCTAAATTT ACATCAACTT 2 04 0 

AACAATGGTT GGTTATACTG AAGATGAAAA TATTTAGTAT AACTTAGTGG AGGCGATAAA 2100 

GGTGCAATTG AGTCATTCCG TTAAAGTTGC AATTTCTATC TATTTAGCAC TTATCTTTAT 2160 

AACGTTCACT TCTTATTTAG TCATTATTTT ATATACGAGT ATGACTGGAC ATGATGTATC 22 20 

ACATTTCGTG TTAGATAGTC AGCATTCTCA TCATGGATCT CTTACGCAAA AACATTTGAG 22 80 

TCTTCCTGAA ATCTCATTTA AATAGTTTAT CCTCTGTGTT TCAACATTCA TTTCCCATAT 234 0 

CGATT CATTT ATCTATCATC TAGACCACTA CATCTTAGAT GATTTTTTTA TTTTCTCATT 24 00 

TCACTCTTTC TTAAAAGTCG ATATAATGAA TTAAATCATT ATCATACACC GACATATTTT 24 60 

ATGTTGTTGG TGTTAAGTTT aAAGGGGTGA GATAC TTGGC GAATaATCaT TCAGCTTT 2 518 
(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

30 ATACTACTGG ACCGTCnACC AATTGACAAA ACCTGAAAAC TGGATTTTTA TTGCAGGATT 6 0 

ACATCGAACC TGGAAGAACA ATCCCCAGGA GTTGATGATA TTCGAGATAT GGCATACAAT 120 

CAAGGTAGTT TAGATAAGAC AATTTATGAA ATTTCTAAAC GCACAGTACT ATTTTTAATA 180 

35 

CAGAAAGATA TTACGGTATA TAATAAGACG ATTGACTGTT TAAATTATTA TAACTATAGT 24 0 

GACGAAAGAA TAAAGGATGA TTAAATGAAT TCACAAGAAT TATTAGCAAT TGCTGTGGAT 3 00 

GCAATTGACA ATACCCCAGG CGAAGATACG ATTTCTTTAG AAATGAAAGG TATCAGCGAT 3 60 

40 

ATGACAGATT ATTTTGTTGT AACGCACGGA AATAATGAAC GACAAGTTCA AGCGATTGCT 42 0 

AGAGCGGTGA AAGAAGTAGC CAATGAACAA AATATAGAAG TAAAACGTAT GGAAGGATAC 48 0 

4S AATGAAGCGC GTTGGATATT AATTGACTTA GCTGATGTTG TGGTACATGT TTTCCATAAA 54 0 

GACGAAAGAA ATTATTATAA TATTGAAAAG TTATATCAAG ATGCACCATT AGAATCATAT 6 00 

AGTCAGGTTG CGTATTAATT ATGTCGCAAT ATG CAGAAAT GAG C CTAGTG TACGATCAAT 6 60 

50 TGACTCAAGA T CAAC CAT AT GAAAAATGGT TTGAAATTGT AAAAAATCAC TGCAAAGATG 72 0 

AATCAAATAT TTTAGATATT GGATGCGGTA CTGGTAGTTa ACAGTTCAAT TAGAAGCTTT 78 0 

55 
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(2) INFORMATION FOR SEQ ID NO: 371: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTAC CTG GACGCTTGAA ATGGTTAGTT 6 0 

GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 120 

IS 

TTAAAAGATC G TAAATT ATT AACAGGTGAC AGTCCTTTAG CTTCAAATGA GTTAGGAAAA 18 0 

TTAGCAGTTA ATGAAATGTT AAATGCAATA CAAAATAAAT AATTAAATAT TAATTAGAGG 24 0 

zo AGCCT CATAT GTAAATGTAT GAGGGCT CTT TTTTTTGGCA AAATTTAAGT GATACTTGTA 3 00 

AAATAGAACC TATTATGAGT ATGATTTAAG AAAACGCTTG CAAAACTAAT AACCGCAACT 360 

AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 4 20 

25 GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 4 80 

ATACA CATGG TGT CATTGAG AGTGAATTAA ATGGTTTGAA AATACCACAT ACATATG CAC 54 0 

TTCAAAATAG TAATGATTAT t TAGAAATTA TGGAAGAAGG AATATCATAT ATAG TACGTG 600 

30 AATCAAAAAT AGATCCAGAC AATATAGTAG GTATTGGTAT AGACTTTACT TCATCTACTA 66 0 

TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AAAACAATTT AAAAACAATC 720 

CACATGCGTA TGTGAAACTT TGGAAACATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 780 

35 

ATCAAACTGC TATTGAAAAT AATAATAAGT GGTTAGGCCA TTATGGATAT AATGTTAGTA 84 0 

GTGAA TGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCAC CAGAA ATTATGGAAA 900 

AAACGGCTTA TATTATGGAA GCGGGCGATT GGATTGTAAA TAAATT AACT AATAAAAATG 960 

40 

TACGCTCGAA TTGTGGATTA GGTTTCAAAG CATTTTGGGA AGAAGAAACA GGGTTTCATT 1020 

ATGATTTATT TGATAAAATA GACCCCAAAT TATCAAAAGT AATTCAAGAT AAAGTATCTG 1080 

4S CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TGATAAAATG GCACAGAAAT 114 0 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 1200 

TAGGTATTGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATGCC 126 0 

50 ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 13 20 

CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GATTTGTTTG 13 80 

55 
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CTGTATTTGA ATTAATGAAT GAAAAGATAA AACATCAAAT GCCAGGTGAA AGTGGGCTCA 150 0 

TTGCTCTTGA TTGG CATAAT GGAAATCGAA GTGTATTAAG TGATAGCAAT TTAACAGGTT 1560 
5 GTATCTTTGG ATTAACTTTA CAAACTAAGC ATGAGGaTAT TTATAGAGCm TATTTaGaAG 162 0 

CTACaGCATT TGGTACtAAG ATGATTATGC mACAGTATCA AGATTGGCAT ATGGaAGTAG 1680 
aAAAGGtATT TGCaTGTGGc gGTAtACCtA AAAAGAATGC TGTTATGATG GATAT CTATG 174 0 

10 

CGAATGTACT GAATAAAAAA CTAATTGTTA TGGATAGTGA GTATGCAC CA GCAATAGGCG 180 0 

CAG CAATATT AGGTGCAGTC AGT 182 3 

(2) INFORMATION FOR SEQ ID NO : 3 72: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 





ACGATCATCT 


GCATCAGCGA 


ATTCCGATGc 


AaTTcATATG 


t CCt AATAAC 


AAAAGAATAG 


60 




GTTTT aAAAG 


ATATGCATAC 


CGTAATGaTA 


GATATAGTTT 


TAAACGTGAC 


TTCAAGCTAT 


120 




ATGaATGTGA 


TGACTGTTCA 


TCATGTTCTT 


TGAGACATCA 


ATGCATGAAG 


CCAAATTCGA 


180 


30 


AATCCAATAA 


GAAAATTATG 


AAGAATTATA 


ATTGGGAATA 


CTTTAAAGCC 


CAAATTAATC 


240 




AAAAGCTTTC 


TGAAC CAGAA 


ACGAAAAAAT 


CTATAGTCAA 


AGAAAAATTG 


ATGTAGAGCC 


300 




TGTTTTTGGA 


TTTATGAAGG 


CTATTTTGGG 


TtTCACTCGA 


ATGTCAGTTC 


GGGAATAAAT 


360 


35 


AAAGTTAAAC 


GAGAGCTAGG 


TTTTGTATTA 


ATGGCACTTA 


ATATAAGGAA AATAgcaGCT 


420 




CAACGAGCTG 


TACATTATAA 


AATACATATC 


AAAAAAGCTG 


ATTTCTATCA 


AATAAATAAT 


480 


40 


AGAAATCAGC 


TTTTTTACAT 


TGCCTAAGAA 


CTTTAAGGAA 


CTTAATGTCC 


CAAGCTCTTT 


540 


TTTGTTATAT 


CTAATTCGTA 


ATTTATGATT 


GTTTATTCGG 


TCCTTTGATG 


TTCACTAAAT 


600 




GTGACTTTAA 


ATCTTGTTCT 


AATTGTTGTA 


ATTCTTTTTC 


AGCTAATTGT 


CGTTCTTCGC 


660 


45 


GACCGTGTTG 


TTGAATAATT 


AATGTTTCTT 


CAATTGTCTC 


AATAATGTTA 


CGCTGTGTAC 


720 




GTTTCAATGT 


ATCAAGATCA 


ACAATGCCAC 


GCTCATTTTC 


TGTTGCAGTT 


TCAATCGCAT 


780 




TTTGTTTCAA 


CA1TTCAGCA 


TTTGCTGTTA 


ATAAATCATT 


AGTTGT AT CA 


GTGACAGCTC 


840 


SO 


GTTGTGCAGC 


AACTGCATTA 


CGCTGTCTCA 


TTAATGTAAG 


CGCAATGGCC 


ATTTGATTTT 


900 




TCCATAGTGG 


AATACTTGTC 


AAAATTGAAC 


TTTGTATCTT 


CTCGGCAAGT 


GCTTGATTAA 


960 



55 
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AGT CAT AT AT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 1080 

TAT CCATTTG ATTAGTGGAt TgCGCGCTTG CTGTTG CAAT TGTGGTAGCT TTTCATTTTC 114 0 

5 TAATTGCAAC TTTTTTTGcT GTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 12 0 0 

TTTGTTTTTA T CAT AT AG CG TATCTAATAA TTCAATATCT CTTGTTAAAT GTGTTTGATG 126 0 

TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 1320 

w 

GATTTCATTG ATAGACGACT TTGCT CTGCT AAAAATTCTT TTTAACATAG ATGGTTTATC 138 0 

AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGAT CTG ACAAAGTATC 144 0 

TCCAATAGGA CCAACATCTT TA CTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 150 0 

)5 

CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 156 0 

TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 16 00 
20 (2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (d) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

30 ATGTTGATAA TGGTAGCTTC TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG 60 

aTAGTAAAAC TGGTGGTTTA GTTGCTATAT CTGGTGGACG TGATTTCAAA GACGTCGTTA 12 0 

ACAGAAACCA AG CAAC AG AT CCTCACCCTA CTGGTTCATC TTTAAAACCT TTCTTAGCGT 18 0 

35 

ATGGAC CTG C CATTGAAAAT ATGAAATGGG CAACAAACCA TGCGATTCAA GATGAATCTT 24 0 

CATATCAAGT TGATGGTTCT ACATTTAGAA ACTATGATAC GAAGAGTCAC GGTACTGTAT 300 

^ CTATTTATGA TGCTTTACGA CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAATCAG 360 

TTAAGCAAAA TGCTGGTAAT GATGCAC CTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 420 

ACGAAGGCGA TATTGGTCCA TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 480 

45 CACAATTAGC ATCAGCATTT GCTGCAATCG CTAACGGTGG TACTTATAAC AACGCGCATT 54 0 

CAATTCAAAA AGTAGTTACT CGTGATGGTG AAACAATCGA ATACGATCAT ACTAGCCATA 600 

AAGCGATGAG TGATTACACT GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 660 

50 CATATGGTTC TGCATATGGC CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 72 0 

CTGGTACTTA CGGTG CTGAA ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 7 80 

55 



1266 



EP0 786 519 A2 

AAGTTAAACA ATATGGTGAA AACTCATTTG TrGGACATAG CCAACAAGAA TATCCACAGT 900 

TCTTATATGA AAATGTGATG TCAAAAATTT CATCTAGAGA TGGCGAAGAC TTTAAACGTC 960 

CTAGCTCAGT AAGTGGTAGT ATCCCATCAA TCAATGTTTC TGGTAGTCAA GATAACAACA 1020 

CTACAAATCG TAGTACACAC GGTGGTAGTG ACACATCAGC AAACAG CAGT GGTACTGCAC 108 0 

AATCAAATAA CAATACTAGA TCTCAACAAT CTAGAAACAG CGGTGGATTA ACAGGTATAT 114 0 

TCAACTAATC CACTCAACAT AAAATCCTCA GTTATAC CAT ATTTATGGTG TAGCCGAGGA 1200 

TTTTnTTAGG TTCTT CATCT TTTATGG 122 7 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



25 


C CAT ATGG t G 


CAACATTCTT 


CGTATTTAGT 


GATTATTTAA 


AACCAGCGTT 


ACGTTTATCA 


60 




TCAATTATGG 


GATTAAATgC 


aACGTTCATC 


TTCACACATG 


AyTCaATTGC 


AGTAGGTGAA 


120 




GATGGTCCTA 


CTCATGAACC 


AATTGAGCAA 


TTAGCTGGAT 


TAAGAGC CAT 


TCCAAATATG 


180 


30 


AATGTTATCC 


GTCCTGCTGA 


TGGTAATGAA 


ACAAGAGTAG 


CATGGGAAGT 


TGCCTTAGAA 


240 




TCTGAATCTA 


CACCTACTTC 


ATTAGTATTG 


ACACGTCAAA 


ACTTACCGGT 


ATTAGATGTA 


300 




CCAGAAGATG 


TAGTTGAAGA 


AGGCGTTCGA 


AAAGGTGCCT 


ATACAGTTTA 


TGGCTCTGAA 


360 


35 


GAGACACCAG 


AATTCCTATT 


ATT AG CTTCA 


GGTTCAGAAG 


TTAGTCTTGC 


AGTTGAAGCT 


420 




GCTAAAGATC 


TTGAAAAACA 


AGGTAAATCA 


GTACGTGTTG 


TTTCAATGCC 


TAACTGGAAT 


480 




GCATTTGAAC 


AACAATCTGA 


AGAATATAAA 


GAATCAGTTA 


TTCCATCAAG 


CGTAACAAAA 


540 




CGTGTTGCGA 


TTGAAATGGC 


TTCACCGCTT 


GG ATGG CATA 


AATATGTAGG 


TACTGCAGGT 


600 




AAAGTTATTG 


CTATTGACGG 


CTTTGGCGCA 


AGTG CACCTG 


GCGATTTAGT 


AGTTGAAAAA 


660 


45 


TATGGATTTA 


CAAAAGAAAA 


TATCTTAAAC 


CAAGTTATGA 


GCTTATAAGA 


ATAATTTATA 


720 




AAGCGAGTAT 


GTTTAGAAGT 


CTAGGATGCA 


TAATCTTAGG 


CTT CTTTTAA 


AGTGTTGAAA 


780 




TTTAGAGTAT 


AG CACTTAAA 


CTACAT CATA 


AGTGATAAGT 


TATGAAAGTA 


TACTATTTCA 


840 


SO 


GATTAATCTT 


TAAAAGCTCT 


GTTATAA CAG 


CATGATTTTT 


GATATTATTT 


TTAGTATCGA 


900 




TATTAAAATA 


CTTGAATAAA 


CTAGTTCTTG 


AAATAATGTG 


ATG AATTTAG 


TAAAATTCAG 


960 



55 
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TGCATTAATT TTAGGTTTAA TTGGAGGTTT CCTTTTAGCT AGAAAATATA TGATGGACTA 1080 

CTTGAAGAAA AACCCACCAA TCAACGAAGA AATGCTTCGT ATGATGATGA TGCAAATGGG 114 0 

5 TCAAAAACCT TCTCAGAAGA AAATTAATCA AATGATGACG ATGATGAATA AAAATATGGA 12 00 

TCAAAATATG AAGAGTGCGA AAAAGTAAAT TCGCAATTGA TAGAGGCTAT TTTCCAGATA 1260 

TGGAAATGGC CTCTTTTTAT AATCAAATTA ATAAGAATAA ATATGTTTAT TAAAATTAAA 13 20 

10 

GTTAACAAAA TGACGAATAG ACTGAGAAAT GCTATAATTC ATTTTGTATG ATTTACAGAG 13 80 

AGTTTATTTA ACGAGAAGGT GTCyGCGTGC TCTATTTAAT ATTTTCAATC ATTGTAGCTT 144 0 

TATTTATGGG AACTATAGTT ATAGTTATTC GTATGAAAGC TCAAAATTAT C CGGTAAAk G 1500 

15 

AGAAAAAAAT AGTTTTGCCa CCgTTTTTTA tGGCgACCGG TGCATTGATG TACGTCGTTC 1560 

CaTATTTTAG GCTAACAGGA TCGGAAATGC TAGAAG CCTT T AT AATTGG T TTGCTTTTTT 162 0 

20 CtACAGTTCT AATTTGGACT TCTCGATTTG AAGTCAAAGG TACAGAAATT TATATGAAAC 16 8 0 

GATCTAAAGC ATTTCCAGTT ATTTTGATTT CATTACTTAT CATTCGTACT GTGATGAAAA 174 0 

TATTCATTAG TAATGAAATA GATCCTGGAG AATTAGGCGG CATGTTCTTT TTATTAGCAT 1800 

25 TCTGTATGAT TGTTCCTTGG AGAGCAGCAA TGCTATATAA ATACAAAAAA CTAAAGAAAA 186 0 

CATTAATCAA TTAATTACTT TTaAAACCAC TTGTGATCGA CTTCTAAATC AGTCAATGAG 192 0 

TGGGTTTAAT nTTACTTGGA AAAGGnGGAA AGG 1953 

30 (2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH : 3787 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

40 

ACATTTGATC AAnTTATCGA CATTAAAGAT GAATTCAnTT GATCGTnTCA ATGATTATCC 60 

TGTTGAAGTA GCACGTTTGC TTGAT AT AG T GGAmATaAAA GTACACGCAT TACATTCAGG 12 0 

4S TATCcACGTT GATTAAAGAT AAAGGGAAAA TAATTGATAT TCATTTATCT GTAAAAGCCA 180 

CTGAAAATAT TGATGGCGAA GTGCTGTTCA AAGCAACACA ACCTTTAGGT AGAACAATGA 24 0 

AGGTTGGTGT TCAAAATAAT GCAATGrCAA TTACTTTAAC GAAACAAAAT CAATGGCTTG 3 00 

50 ATAGTTTGAA GTTTTTAGTT AAGTGCATTG AAGAAAGTAT GAGAATCAGT GATGAAGCAT 3 60 

AAAGAAGCAT TTAATGGCGT TGTCGTGTTA ACTGCTGCAT TAATTGTCAT TAAAATT CTG 4 20 
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CAACAAGTGT ATCCAATTGT AGCATTAGGA ATGATATTAT CGATGAATGC CATTCCTAGT 54 0 

GCAATTACAC AAAATAT AG G GAAGTATCAT AGTGACGAAG CATATGCAAA AGCAGTCGCT 60 0 

TATATACAAT TAGTTGGTAT ATTATTATTT ATTG CTATTT TTGTGTTTGC GAACAATATT 660 

GCACATATGA TGGGTGATGG CCATTTAACA CCAATGATTC AAGCTGCAAG TTTAAGCTTT 720 

ATATTTATAG GTATGCTTGG CGTGTTAAGA GGTTATTATC AATCTGCAAA TAATATGACA 780 

GTTCCGGCTA TTTCCCAGGT TATAGAACAA GTTATACGAG TAGGTATTAT CATTGTTACT 84 0 

ATTGTTATTT TTGTAGACAG AGGTTGGACG ATATATGAAG CGGGAACAAT TGCTATTTTA 900 

GCATCAACGA TAGGTTTTTT AGGTTCTTCA ATTTATTTAG TAGCGCACCG AC CTTTTAAG 960 

TTTAAAATGG TAAATAACAC TGCAAAGATC GTTTGGAAAC AGTTCGCACT TTCGGTTTTG 102 0 

ATTTTCG CTA TCAGTCAATT AATCGTAATT TTATGGCAAG TGATTGATAG TGTTACTATT 1080 

ATTAAGTCAC TTCAAGCGAT ACGCGTGCCA TTCGATGTTG CCATAACTGA AAAAGGAGTC 114 0 

TATGACCGTG GTGCATCATT TATTCAGATG GGATTGATTG TAA CTACAAC ATTTAGTTTT 1200 

GCGCTCATTC CTCTGTTAAG TGACGCAATC AAAATGAATA ATCAGGTACT TATGAATCGT 1260 

TATGCAAATG CGTCATTAAA GATTACGATT TTAATAAGTA CAGCAGCGGG AATAGGATTA 13 20 

ATTAATTTAT TGCCTTTAAT GAACGGTGTG TTTTTTAAGA CGAATGATTT AACCTTAACG 13 80 

TTAAGTGTTT ATATGATTAC GGTCATTTGT GTATCGTTAA TTATGATGGA TATGGCATTA 14 4 0 

TTACAAGCGC AACATGCTGT GAGACCTATT TTTGTTGGTA TGACGGCAGG ATTGGTTATT 1500 

AAATTTATAC TTAATATCAT TTTGATTCGT TTAAGTGGCA TTATTGGTGC GAGCATTAGT 1560 

ACTGTTGTAT CATTAATTAT ATTCGGTACG ATTATCCATA TTGCTGTCAC GAGAAAATAC 162 0 

CACTTATATG CGATGAGACG ATTTTTTATC AATGTTGTTT TAGGTATGGT ATTTATGTCG 1680 

ATTGTTGTTC AATGCGTGTT AAACATAGTG ACAACACACG GTAGAATCAC TGGACTCATT 174 0 

GAATTATTAT GTGCAGCAGT ATTAGGTATC ATTGCATTGT TTTTCTATAT TTTTAGATTT 1800 

AATGTTTTGA CATATAAAGA GTTAACTTAT TTAC CATTTG GTTCAAAGTT GTATCAAATT 1860 

AAGAAAGGAA GACGTTGATG GCACATACCA TTACGATTGT TGGCTTAGGA AACTATGGCA 1920 

TTGATGATTT GCCGCTAGGG ATATATAAAT TTTTAAAGAC ACAAGATAAA GTTTATG CAA 1980 

GAACGTTAGA TCATCCAGTT ATAGAATCAT TGCAAGATGA ATTAACATTT CAGAGTTTTG 2 04 0 

ACCATGTTTA TGAAGCACAT AACCAATTTG AAGATGTCTA TATTGATATT GTGGCGCAAT 2100 

TGGTTGAAGC TGCTAATGAA AAAGATATTG TCTATGCGGT TCCGGGTCAT CCTAGAGTTG 2160 

CTGAGACAAC TACAGTGAAA TTACTGGCTT TAGCAAAGGA CAATACTGAT ATAGATGTGA 22 20 
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ATGATGGCTT 


CACACTGTTA 


GATGCGACAT 


CATTACAAGA 


AGTAACACTT 


AATGTTAGAA 


2340 


CGCATACATT 


GATTACGCAA 


GTTTATAGTG 


CAATGGTTGC 


TG CTAATTTG 


AAAAT CACTT 


2400 


TAATGGAACG 


ATATCCTGAT 


GATTACCCTG 


TTCAAATTGT 


CACTGGTGCA 


CGAAGCGATG 


2460 


GTGCGGATAA 


CGTTGTGACA 


TGCCCATTAT 


ATGAATTGGA 


TCATGATGAA 


AATGCATTCA 


2520 


ATAATTTGAC 


GAGTGTATTC 


GTACCAAAAA 


TCATAACATC 


GACATATTTG 


TATCATGACT 


2580 


TTGATTTTGC 


AACGGAAGTG 


ATTGATACTT 


TAGTTGATGA 


AGATAAAGGT 


TGTCCATGGG 


2640 


ATAAAGTGCA 


AACGCaTGmA 


AcgCTAAAGC 


GTTATTTACT 


TGAAGAAACA 


TTTGAATTGT 


2700 


TCGAAGCTAT 


TGACAATGAA 


GATGATTGGC 


ATATGATTGA 


AGAACTAGGA 


GATATTTTAT 


2760 


TACAAGTGTT 


ATTGCATACT 


AGTATTGGTA 


AAAAAGAAGG 


GTATAT CG AC 


ATTAAAGAAG 


2820 


TGATTACAAG 


TCTTAATGCT 


AAAATGATTC 


GTAGACACCC 


ACACATATTT 


GGTGATGCCA 


2880 


ATG CTGAAAC 


TATCGATGAC 


TTAAAAGAAA 


TTTGGTCTAA 


GGCGAAAGAT 


GCTGAAGGTA 


2940 


AACAGCCAAG 


AGTTAAATTT 


GAAAAAGTAT 


TTGCAGAGCA 


TTTTTTAAAT 


TTATATGAGA 


3000 


AGACGAAGGA 


TAAGT CATTT 


GATGAGGCCG 


CGTTAAAGCA 


GTGGCTAGAA 


AAAGGGGAGA 


3060 


GTAATACATG 


AGATTAGATA 


AATATTTAAA 


AGTATCACGG 


TTAATAAAGC 


GACGTACGCT 


3120 


AGCAAAAGAA 


GTAAGTGATC 


AAGGTAGAAT 


TACAATAAAT 


GGTAATGTTG 


CTAAAGCTGG 


3180 


ATCGGATGTT 


AAAGTTGAAG 


ATGTGCTGAC 


GATTCGCTTT 


GGTCAAAAAT 


TAGTAACAGT 


3240 


TAAAGTAACT 


GCATTAAATG 


AACATGCATC 


TAAAGATAAC 


GCGAAGGGTA 


TGTATGAAAT 


3300 


CATTGAAGAG 


CGTCGACTTG 


AAGAAGCGTA 


AATTGGAGGT 


GACAAGCAAT 


GAAAAATAAA 


3360 


GTAGAA CATA 


TAGAAAATCA 


GTACACGTCG 


CAAGAGAACA AGAAAAAACA 


ACGTCAAAAA 


3420 


ATGAAAATGC 


GTGTTGTTCg 


TAGGCGTATT 


ACAGTATTTG 


CGGGCGTATT 


a CTTG CGATA 


3480 


ATTGTTGTTT 


TATCaATCTT 


GCTTGTTGTC 


CAAAAACATC 


GCAATGATAT 


TGATG CACAG 


3540 


GAGCGAAAAG 


CGAAAGAAGC 


ACAGTTTCAA 


AAGCAACAAA 


ATGAAGAAAT 


TGCGTTAAAA 


3600 


GAAAAGTTGA 


ATAATCTGAA 


TGACAAAGAT 


TACATTGAAA 


AAATTGCGCG 






TACTTAAG CA 


ACAAAGGTGA 


AGTGATTTTT 


AGGTTGCCAG 


AAGACAAAGA 


TTCGTCTAGC 


3720 


TCAAAATCTT 


CGAAAAAATA 


AATCCAAATT 


GATTCAAAAT 


TATCCGAGTA 


TAGACATTGT 


3780 


GAAAAAA 












3787 


(2) INFORMATION FOR SEQ ID NO: 376: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

5 TAAACCATTT CAAACTGAGG AACGCnAAGA CGGsACgTTT CCAGATTTAG AAGTATTTAA 6 0 

AAATGAATGT GATTTAAGCT ATGACATAAC GTCACTTTAT ACTTTTAAGC AACCTGTATC 12 0 

ACCACACCTT GCATTTAAAA TGACAGATCA AATTTTTCTA AATAAGCAGC GTGTATTAGA 18 0 

10 

TAAGGTAAAA GTTTTAGATA AGGAATTTGA TTTTATCTTA ATTGAGGGTG CTGGGGGAAT 24 0 

TGCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACTAAAGATC T AAT CAATGA 3 00 

TTGTGCAGAT TGTGTCATCA GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 3 60 

15 

TGTTCACCAA GATTATGTTA ATCAGAATGT ATCGGCGAGT AATTTTTTAA TAATGAATCG 4 20 

CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATTGGAAAAT TAACAAATAA 4 80 

2 0 AACAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT TTCTCAGAAG CATTTTTAAA 54 0 

ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAGAAT 600 

ATGTTTGGCA TCCATTTACA CAAATGGGTG TATATAGCAA AGAAGAAGCA ATCATCATTG 660 

25 AAAAAGGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATATTTA GATGGTTATG 72 0 

CATCGTTGTG GGTCAATGTG CATGGT CAT A ATAACAAATA CTTnAATAAG GTAATTAAAA 78 0 

AGCAACTCAA TAAAATTGCC CATTCTACGC TGCTAGGATC ATCAAATATT CCGTCAATAG 840 

30 AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TTTTATTCTG 900 

ATACAGGCAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 960 

TTGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 10 2 0 

GGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATCAAGAC CTTTCATAAA ATATTTAAAG 108 0 

ACTTAATATT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATCGCAGT AATT A CG AT A 114 0 

CTGAAAATGA AATGATGACA GCTATTTTAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 1200 

40 

ATGATGAAAT CGCAGGGTTT ATATTGGAAC CGTTGATTCA AGGTGCGACA GGCTTGTTTG 126 0 

TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 13 2 0 

45 TAATTTGTGA TGAGGTAGCA GTTGGTTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 1380 

ATGAAGATGT TCAACCGGAT ATTATGTGTT TAGGTAAGGC GATTACTGGT GGCTACTTAC 144 0 

CACTTGCAGC TACATTGACA TCTAAAAAAA TATACAATGC ATTTTTAAGT GATT CGCATG 1500 

50 GTGTGAATAC CTTTTTCCAT GGTCaTACAT ACaCCGGAAA T CAAAT cGTT TGTa CGGTTG 156 0 

cATTaG aAAA T ATaAG a CTT TATGaAAAAC GTaAGTtnAT TGTgCACATa TTGaAACGaC 162 0 

55 
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(2) INFORMATION FOR SEO ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

to 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 
ATG ATTTT t A aAAATCATTA AGTTAAGGTr GATACACATC TTGTCATATG ATCAAATGGT 60 

^ TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTGCGAAC TCGATATTTT ACACGACTCT 120 

CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 180 
CACGCmTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTGGCCGTAA CCTGCCAACC 240 

20 AAAGCGAGAA CAAAACATAA CAT CAAA CGA ATCGACCGAT TGTTAGG TAA TCGTCACCTC 3 00 

CACAAAGAGC GACTCGCTGT ATACCGTTGG CATGCTAGCT TTATCTGTTC GGG CAATACG 3 60 

aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC AAAAACGGCT TATGGTnTTG 4 20 

25 CGAGCTTCAG T 431 

(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 00 6 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 



TTTnTATAAC 


G TATTATAAA 


TCGTTAAAAA 


TTTTGGTTGT 


GTTTGCGTCA 


CGTAGACAAC 


60 


CTCCATAAAG 


TTACTTAATC 


ACTCTCATCA 


TACAATAATT 


TTTACTCAAA 


TTGGAAnAAT 


120 


TATAAAAATT 


AAATATAGAT 


AGGCTTTGAA 


AATTAGTTTT 


ATACAAGGTT 


AGTAGCTGTA 


180 


ACTGTAAAAT 


GTTCTTAATA 


TTGTCAAAAT 


GTAATGCTTG 


AAAGCGCTTT 


TAAAaAATAT 


240 


TATTATATAC 


ATGGTTAGAC 


AAATAGACAA 


ATCACTATAC 


AAATATTGGG 


AGGAATATTT 


300 


TATGAAATCA 


ACACCACACA 


TTAAAC CAAT 


GAATGACGTC 


GAAATTGCAG 


AAACGGTTCT 


360 


ATTGCCAGGA GATCCGTTAA GAGCTAAGTT 


CATTGCAGAA 


ACTTATTTGG 


ATGATGTGGA 


420 


ACAGTTCAAT 


ACAGTGCGAA 


ACATGTTTGG 


TTTTACCGGA 


ACATATAAAG 


GTAAAAAAGT 


480 


TTCTGTCATG 


GGTTCAGGTA 


TGkGTATGCC ATCTATTGGC 


ATTTACTCTT ATGAATTAAT 


540 
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CATTGATTTA TATGATGTGA TTaTTkCACA AGGTGCCTCT ACTGATTCAA 


ATTACGTTCA 


660 




ACAATATCAA TTACCAGGTC ATTTTGCGCC AATTGCTTCT TATCAATTAT 


T AG AAAAAG C 


720 


5 


AGTTGAAACA GCACGTGACA AAGGTGTACG TCATCATGTA GGTAATGTGT 


TATCAAGTGA 


730 




TATTTTCTAT AACGCGGATA CAACAGCGAG TGAACGTTGG ATGCGTATGG 


GTATTTTAGG 


840 




TGTAGAAATG GAATCaGCTG CaTTATACAT GaATGCAaTT TACGCTGGTG 


TCGAAGCATT 


900 


W 


AGGTGTGTTC ACAGTGAGCG AT CATTTAAT TCATGAAACG TCAACAAcAC 


CTGAGGAAAG 


960 




GGAACGTGCA TTTaCAGATA TGATTGAAAT TGCACTGTCA TTGGTGTAGA 


TGATTATGAA 


1020 


15 


TGTTGAATAT TCTAAAATAA AGAAAGCAGT ACCTATTTTA TTATTCTTAT 


TTGTATTCAG 


1080 


TTTGGTTATA GACAACTCAT TTAAATTGAT TTCTGTAGCC ATTGCTGATG 


A CTT AAA CAT 


1140 




ATCTGTAACG ACAGTAAGTT GGCAAGCGAC ATTAG CCGGT TTAGTAATTG 


GTATTGGCGC 


1200 


20 


TGTAGTATAC GCTTCATTAT CTGATGCCAT TAGTATACGC ACACTATTTA 


TTTATGGCGT 


1260 




GATATTAATC ATTATCGGAT CAATTATTGG TTACATTTTC CAACATCAAT 


TCCCATTACT 


1320 




TTTAG TTGG A CGTATTATTC AAACTGCCGG TTTAGCTGCT GCAGAGACAT 


TATATGTGAT 


1380 




ATATGTTGCA AAGTATCTTT CTAAAGAGGA CCAGAAGACT TAC CTTGGCT 


TAAGTACGAG 


1440 




CAGTTATTCC TTGTCATTAG TTATCGGTAC ATTATCAGGT GGATTTATTT 


CTACGTATTT 


1500 




ACACTGGACA AATATGTTTT TAATTGCATT AATCGTAGTA TTTACGTTGC 


CATTCCTATT 


1560 


30 


TAAATTATTA CCAAAAGAAA ATAATACGAA TAAAGCTCAT TTAGATTTTG 


TTGGCTTAAT 


1620 




TCTAGTGGCA ACTATTGCTA CAACAGTCAT GCTGTTTATT ACGAACTTTA 


ATTGGTTATA 


1680 




TATGATTGGT GCCTTAATTG CGATTATCGT TTTTGCGCTA TATATTAAAA 


ATGCGCAACG 


1740 


35 


TCCATTAGTA AATAAATCAT TTTTCCAAAA TAAACGTTAT GCTTCATTTT 


TATTTATAGT 


1800 




ATTTGTAATG TATGCTATCC AATTGGGTTA TATTTTTACG TTCCCATTCA 


TAATGGAGCA 


1860 




AATTTAT CAT CTGcAACTAG ACACAACATC ACTGTTATTA GTACCGGGgT TaTATAGTAG 


1920 


40 


CAGTCATTGT TGGCGgCACT AAGTGGgTtA AAATCGGgCG rAATATCTGG AATTCCAAAA 


1980 




CCAAGCGGAT TATCACAGCC AATTAA 








(2) INFORMATION FOR SEQ ID NO: 379: 






45 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 







SO 
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TGGGGAAAAA AAGACCCAGC AGTATTAGAA GAATCGTTAA ATATTTCTAT TGAAGAAATG 60 

AATCGTATCA TAAAATTAGT CGAAGAATTA CTTGAATTGA CTAAAGGAGA TGTAAATGAC 120 

ATTTCTTCTG AAGCACAGAC CG TG CAT ATT AATGATGAAA TTCGCTCGCG AATACACTCA 180 

TTAAAACAAT TGCATCCTGA TTATCAATTT GATACGGATC TGACATCTAA AAATCTAGAA 240 

ATTAAAATGA AACCTCATCA ATTCGAACAA TTATTTTTAA TCTTTATTGA TAATGCAATC 3 00 

AAATATGATG TGAAGAATAA GAAAATTAAA GTTAAGACAA GGTTAAAAAA TAAGCAAAAA 3 60 

AT AA TTG AAA TTACAGATCA TGGAATTGGT ATTCCAGAGG AAGATCAAGA TTTCATTTTT 4 20 

GATCGCTTTT ATCgAGTGGA TAAATCTCGT TCAAGAAGTC AAGGCGGTAA TGGACTCGGA 4 80 

TTATCTATTG CTCAAAAAAT CATTCAATTA AACGGAGGAT CGATTAAAAT TAAAAGTGAA 54 0 

ATTAACAAAG GAACAACGTT TAAAATCATA TTTTAATCAT GACTGAGACG TCAATCAAAG 6 00 

T CAT AGGAT C AATTTTTTAA GTACACATTA GCTGTGACTA ATGTATAAGA ACAACTATAA 660 

AACAAATAAA CAGTGGTTCT TTATCATTTC TGTTGTACTC CCAAAATTTA CAATAAAATA 720 

CAT CTAT AAA CCTAGAAGAA TCAACGCTTT TGTTGATTCT TCTTTTTAGC AGATAAATAG 7 80 

GTAAATCTAC TTTAACAAAT AACTAAATAG TGATATTATT ACATTGTAAG CGTTTCAACA 84 0 

TTTTTGTGGA GGGTGTAAAA TGACTAACGA AAGAAAAGAA GTTTCAGAGG CTCCTGTAAA 90 0 

CTTCGGTGCG AATTTAGGTC TAATGTTAGA TCTATATGAT GACTTTTTAC AAGATCCATC 96 0 

ATCTGTACCA GAAGATTTAC AAGTCTTATT CAGCACAATT AAGAATGATG ACTCAATTGT 102 0 

ACCAGCTTTA AAAAGTACAA GTAGTCAAAA TAGCGACGGC ACAATTAAGC GTGTCATGCG 10 8 0 

TTTAATTGAT AATATTCGCC AATACGGGCA TCTTAAAGCC GATATTTATC CTGTAAATCC 114 0 

TCCAAAAAGG AAACATGTAC CTAAATTAGA GATTGAAGAC TTTGATTTAG ATCAACAGAC 120 0 

TTTGGAAGGT ATATCAGCAG GAATTGTTTC AGATCACTTT GCCGACATTT ATGATAATGC 126 0 

TTATGAAGCA ATTTTAAGAA TGGAAAAACG TTACAAAGGA C CAATTG CAT TTGAGTATAC 13 20 

ACATATTAAT AACAATACCG AACGTGGTTG GTTAAAAAGA AGAATTGAAA CGCCATATAA 13 80 

AGTAACGTTA AATAATAACG AAAAAAGGGC ACTATTCAAA CAATTAG CGT ATGTTGAAGG 144 0 

GTTTGAAAAA TATCTTCATA AAAACTTCGT TGGTGCAAAG CGTTTTTCAA TTGAAGGGGT 1500 

AGACGCACTT GTACCGATGT TACAACGTAC TATTACGATT GCTGCGAAAG AAGGTATTAA 156 0 

AAATATACAA ATAGGCATGG CTCACCGTGG ACGTTTAAAC GTTTTAA CG C ATGTCTTAGA 1620 

AAAACCGTAC GAAATGATGA TTTCAGAATT TATG CAT ACA GATCCAATGA AATTCTTAC C 1680 

TGAAGATGGT AGCTTGCAGT TAACTGCTGG ATGGACTGGT GATGTGAAAT ATCACCTTGG 1740 
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AAGTCACTTG GAAATTGTTG CACCTGTTGT TGAGGGGCGT ACGAGAGCAG CACAAGATGA 1860 

TACACAACGA GCTGGGGCTC CGACGACTGA TCATCATAAA GCAATGCCAA TTATTATACA 192 0 

TGGCGATGCT GCTTATCCTG GTCAAGGAAT TAACTTCGAA ACAATGAACT TAGGAAACTT 19 80 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA TATTATTACT AACAATAGAA TTGGATTTAC 2 04 0 

TACAGAACCA ATTGATGCAC GTTCAACAAC TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

TGTGCCAATA TTCCATGTCA ATGCAGATGA CGTTGAAGCT ACTATTGAAG CAATTGATAT 216 0 

TGCAATGGAA TTTAGAAAAG AGTTTCATAA AGACGTCGTT ATTGATTTAG TAGGTTATCG 2220 

TCGTTTCGGA CATAACGAAA TGGATGAACC ATCAATTACT AATCCaGTTC CTTATCAGAA 22 80 

TATTCGCAAA CATGACTCTG TTGAATATGT GTTTGGTAAA AAG CTTGTTA ATGAAGGTGT 2 340 

CATTTCAGAA GATGAAATGC ATTCATTTAT AGAACAAGTC CAAAAGGAAC TAAGACAAGC 24 00 

T CAT GAT AAA ATTAATAAAG CTGATAAAAT GGATAATCCA GATATGGAAA AGCCTGCAGA 24 60 

TCTTGCATTA CCGTTACAAG CAGACGAACA ATCATTTACT TTTGATCACT TGAAAGAAAT 2 520 

AAATGATGCA TTGTTAACAT ATCCGGATGG CTTTAACATT TTGAAAAAGT TAAACAAAGT 2 5 80 

TCTTGAGAAG CGTCATGAGC CGTTTAATAA AGAAGATGGT TTAGTTGATT GGG CACAAGC 2640 

AGAACAACTT G CATTTG CG A CAATTTTACA AGATGGTACA CCGATTCGCT TAACTGGTCA 2700 

AGATAGTGAA CGTGGTACAT TCAGTCATAG GCATGCCGTG TTACATGATG AGCAAACAGG 2760 

TGAAACATAT ACACCTTTAC ATCATGTTCC TGATCAAAAA GCGACATTTG ATATACACAA 2820 

TTCTCCG CTT TCAGAAGCAG CAGTAGTTGG TTTTGAATAC GGCTATAATG TGGAAAACAA 2 8 80 

AAAAAGCTTC AATATTTGGG AAG CACAATA TGGTGATTTT GCAAATATGT CACAAATGAT 2 94 0 

TTTTGACAAC TTCTTATTCA GTTCTCGCTC AAAATGGGGA GAACGTTCAG GATTAACATT 30 00 

ATTCTTACCT CATGCATATG AGGGTCAAGG GCCTGAACAT TCATCAGCAA GATTAGAGCG 30 60 

ATTTTTACAA TTAGCTGCTG AAAATAATTG CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TTATTTCCAC TTATTGCGTG CACAAGCGGC TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

GGTTGTTATG TCACCAAAAA GCTTACTGAG AAATAAAACA GTTGCAAAAC CAATTGATGA 3 24 0 

ATTTACTTCT GGTGGATTTG AGCCAATTTT GACAGAATCA TATCAAGCGG ATAAGGTTAC 33 00 

AAAAGTTATT TTGGCAACTG GTAAAATGTT CATTGATTTA AAAG AAG CAT TAGCTAAAAA 33 60 

TCCAGACGAA TCAGTATTAC TCGTTGCGAT TGAAAGATTG TATCCATTCC CAGAGGAAGA 34 2 0 

GATTGAAGCA TTACTAG CAC AATTGCCAAA CCTTGAAGAA GTGTCATGGG TACAAGAAGA 34 80 

ACCTAAAAAT CAAGGTG CAT GGTTATATGT CTATCCATAT GTTAAAGTGC TAGTTGCAGA 3 54 0 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 3 660 

GGAAATAAGT CATGC CAGAG GTTAAAGTTC CAGAATT AG C AGAATCTATT ACAGAAGGTA 3 7 20 

5 CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTG 3 7 80 

AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTG 3 840 

AACAACTTGC AAGTGAAGGC GACACTGTAG AAG TTGGACA rGCAATTGCT ATCATCGGCG 3 9 00 

w 

AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACT CCA CAA CAAAATGAAG 3 960 

AAACAAATAA TAAAAAAGAA G AAA CAA CAA ATAATTCGGT AG AT AAAG CT GAAGTAAATC 4 020 

)s AAGCAAATGA TGACAATCAG CAACG TATTA ATGct ACGCC TTCTGCGCGT CGATATGCTC 4 0 80 

GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGTG GTTCGTAAAG 4140 

AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAACACA AACAACACAA CAAGCATCTG 4 2 00 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATT CGT GAAAAAATGT 4260 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATGT 4 3 20 

TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 43 80 

25 ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT ATGTCATTCT TTACTAAAGC 444 0 

TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 45 00 

GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 4 56 0 

30 

ACCATTTGTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTGcTAA 462 0 

TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT 46 80 

3S TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 474 0 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCGA TTGATCAAGA 4 800 

TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 4 85 8 

40 (2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
4S (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 80: 

50 

ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 6 0 

TCAATATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 12 0 
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ATGCTATTTA GCTAAAGCTA AAAGACCAGA CACTATGCAT ATTT CAACTG GAAATATGTG 24 0 

GCGATACTTA GTTGCAATTA TTG C CTGTAT GATTTGGTAC CTTAATAAAG CGCATGTAAG 300 

5 TATCATCGGT ATAATTATTG G TTTAATG AT TTCATATGTT GTAGTTATCA TACGTCCTTT 3 60 

ACTAAAGGTG AGCAAATAAA TTAAGAAAGA GGTGAGATTA TGGATCACAA ATCCCCGCTC 420 

GTGAGTTGGA ATTTATTCGG TTTTGaTATC GTTTTCAATT TATCAAGTAT ATTGATGATA 4 80 

w 

CTTG TTACGG cGTTTCTTGT TTTTCTACTT GcTATCATTT GTACGCGTAA TTTGAAAAAA 54 0 

AGAC CAACTG GCAAACAAAA TTTCGTTGAA TGGATTTTTG ATTTCGTGAG GGGAAT CATT 6 00 

GAAGGTAACA TGGCTTGGAA AAAAGGTGGT CAATTCCACT TCTTAGCAGT AACGCTGaTT 66 0 

15 

CTGTACATTT TTATAGCTAA TATGTTAGGT CTTCCGTTTT CTATAGTAAC GAAAGATCAC 720 

ACATTGTGGT GGAAATCACC GACAGCnGAT GCAACAGTGA CTTTAACGTT GTCTACAACG 780 

20 ATAATACTGT TAACTCACTT TTATGGAATT AAAATGCGTG GTACGAAACA AT AT CTTAAA 84 0 

GGTTATGTAC AGCCGTTTTG GCCATTGGCA ATTATTAATG TTTTTGAAGA GTTCACTTCA 900 

ACATTAACG C TTGGTCTGCG TTTGTACGGT AACATATTTG CAGGTGAGAT ACTATTAACA 960 

25 TTACTTGCTG GCTTATTCTT TAACGAACCA GCATGGGGTT GGATTATTAG TATCCCAGGA 1020 

TTAATTGTTT GGCAAGCATT TTCAATATTT GTAGGAACAA TCCAAGCATA TATCTTTATT 1080 

ATGCTTTCGA TGGTTTATAT GTCACATAAA GTGGCAGATG AACACTAAAA ATTTCAATAA 114 0 

30 

TTATATACAA TCACAGGAGG AAATTAAATT ATGAATTTAA TCGCAGCAGC AATCGCAATT 12 00 

GGTTTATCAG CATTAGGAGC AGGTATCGGT AACGGTTTAA TCGTTTCAAG AACAGTTGAA 1260 

GGTGTAGCAC GTCAACCAGA AGCACGTGGT CAATTAATGG GTATCATGTT CATTGGTGTA 1320 

35 

GGTTTAGTTG AGGCATTACC TATCATCGGT GTAGTAATTG CATTCATGAC ATTTGCTGGA 13 80 

TAATTAACAG ATAAAAGAGG TCGGGACAAA GCGCATAGGA CATAATTCAT GATGCATATA 1440 

40 TAGTAATATC TTTGAACTTT ATTAAATAGT TGAGATATGA ACG CACCATG CCTATCGCAT 1500 

AAATTCAGTA GGTCCTAACC TCGTCGTTTT TTTCTATATA ACACTAGCGA TTATTTTAAT 1560 

GAAAGGAGTG TCATGAACCC GTGACTGAAA CAGCTAACTT ATTCGTTCTT GGTGCAGCTG 1620 

45 GAGG CGTTGA GTGGGGTACT GTGATTGTAC AGGTCCTAAC TTTCATCGTG TTAcTTGCGT 16 80 

TACTTAAAAA GTTCGCATGG GGTCCATTGA AAGATGTAAT GGATAAACGT GAAAGAGATA 174 0 

TTAACAGAGA TATCGATGAC GCAGAACAAG CTAAGTTAAA TGCACAGAAA CTTGAAGAAG 1800 

SO 

AAAATAAACA AAAACTTAAA GAAACACAAG AAGAAGTTCA AAAGATTTTA GAAGATGCTA 1860 

AGGTTCAAGC ACGTCAACAG CAAGAACAAA TTATTCaTGA AGCAAACGTA CGTGCAAACG 1920 

55 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCTT AGAAAAGAAA 2 04 0 

TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 2100 

AATGGTAAAA GTAGCTAACA AGTATgcTAA AG CATTATTT GACGTGTCAT TAGATACAAA 2160 

TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 2220 

TG 2222 
(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TAACAATTCC TaTATTCATG TTTAATACGA AACACTACAT TTACATTGTA ATTCACTATC 6 0 

TTTTGAAGTA ATAAAGTGAT TTGTTCAATC GATAGCTCAT TGCTTGTGTC GATTGTAACA 120 

ATTAT AT GCA AGTTTTCAGG ATTAACACCT AATCTTTGAA TGATTTGTTT AATAGTATAA 18 0 

TAATCCATCC AATAAAAATT ACTTCCTTTA ATATAAATGT TTTTAGGTTG ATACATTTGA 240 

CTCCTTTTTG TAGGCTCAAA AGGTATATCA ATCTCGCGCA TACTTGAAGA ACTTTGATTA 3 00 

GTATCATCAA ATAATTCAAT TATATTTTTA TCAATTTCAA CTAATTGGGA ATGGTTAATT 360 

GAATGAGATG TTGGACTATA TCTTTTTCTA ATTAATTTAG GAGTGTTACC ATACGTTTCT 420 

TTAAAGAGGT GTATAAAACG AGAATAATGA TTAAAACCAT GACTACTTGC GATTTCTTCA 4 80 

ATAGG CTTCT TAGrAGTTAA AATATCAATT AAGCAATGCT CCAGTCTAAT ATGATTTAAA 54 0 

TATTGAATAA AATTACTATA AGGTGTCGCT TTAAACATGT CACTTAAAGC TTTGTTTGTA 6 00 

ATACTAACTT GATTAATGAC ATCTTTCCTA TTTATCTTTT TATGG TGGTT GTTTGTTAAA 660 

TAATCGTGCA CTTCTTCGGC TACTAAATGA CGACTACCAT CGTATGTATT TAATGACATT 720 

AATTCAACAC ACATGTAATT AATAATCTTA TCATTAGCAT TATAAGACTG TTGTTTAATT 7 80 

TTGCTGTAGA TTAAATACTT AATCAAGATT CTAAAAGTGC TAGCAACCTC ATTTGTTAAT 84 0 

ATCTTGCCAC GCATCATATA GTTTGCATTC ATAAACTTCA AAAATGTCTT TGCACTTACT 900 

TCAACAACAC AACATACACT ATCATCATTA CCATCAATTT GATACAAATC ATTCAACATA 960 

ATAATGGTGA CATCATTTCT TTTTACATCA TATTGTTGTA GATTGATTGA AACTACACCT 1020 

GTACCTTGTA GCCAATATAT AATTTTtAAG TTCGCTTCCT TCACACTACT CATTTTCATA 1080 
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ATT 

(2) INFORMATION FOR SEQ ID NO: 3 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

GGACCACCTT TTTTACCTGT ACGACTAAAG CCAACTGTTT TAAAGGCACC AACAG CATGC 6 0 

CACACTTGAA TAATTTCTTG TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 12 0 

TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 180 

CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 24 0 

ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 300 

GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 36 0 

AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 42 0 

AGCCAGTGTG cACTTg TCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 4 80 

TCAAATACAA ATTCGTTAAC GTCACT 506 
(2) INFORMATION FOR SEQ ID NO: 3 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

AGCATCTGCA AtGAGTcTCT AATACAACGA TACGTTTTGC ATCTTTAGGT ACTTTTACTG 60 

TAC CATTTTC ATCTTTTACC G AAATAG TAT CTTTAGTTGA TGATTCTTTT TTACTTGAAT 12 0 

TATCCGTATT ACCACAAGCT GCAACTAAAA GTAAGG CAAC TATTAATCCC AATATACTAA 18 0 

AAGTTTTTAG ACCTCTCATC GTTCCACTCC TTAATATGTA TAACTTCATT TATTATTTTA 24 0 

TTGATAACAA TTATCATTGT CAAGTAGCGT TCAATCTTTT TTATATTTCT AAAATGTATG 30 0 

ACTATATATT TCCTCTAATA ATTATGACTA CAATTAGCAC ATTTCCTTAG ACAAAATACT 360 

GATAATGTAT CATTGCTATA TCATCTTTGC ATTAATACAA TTGACACCAT TTAGCATGAC 42 0 
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(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

TGCACTTGAT ACTTTAGCAC ATGAAATTGC TAATCGATTA GTTGCAAATG ACAAGAATGA 60 

AGCAACTTTG GAAATGACTA ATAAAATGGC AACGATTCGT TTTACAGAAC CTACGCTGAT 120 

TGCATTAGCA GGGGGTAATG TCAAAGCTTA CACTGAG CAT ATGACTATAT CTCCATATAA 180 

ATTGTATTTG TTAGATAAAG GCGATGTTTT AAAGTTTAGA GAAACAAGTT ATACATCGCG 24 0 

AGTGTATTTA GCTGTgGGAG GCGGATTTGA ATTAGATGCA TGGTTAGGAT CTAACTCAAC 3 00 

CGACTTTAAT GTAAAAATTG GTGGTTTTAA AGGTAGAACA TTACAAGATG GCGATGAAAT 360 

AAAGCTTAAG AGAGATTATA CAGCTCGTCA TCATAAGTTA TTTGAAAACC TTGCTCACAC 420 

GAAACAAACA GATTGGGGTA TTGATGGATA CGCCTTGTCA TTTAATTATA TGTCTGATGT 4 80 

ATTT CATGTC GTTAAAAATA AAGGTACGGA AGATTTTAAA GAAGATGCCA TT CAAAG ATT 54 0 

TGTGAAACAT GATTATAAAG TAACGAGCAA AGCAAATCGC ATGGGGATGA TGCTTGAAGG 600 

TGAAAAAATC AAAGCTTTTT ATGAAGATAT GCCACCGTAT CAGACTGTCA AAAAAGGAAC 660 

GATACAAATT AAGCGTGATG GCACACCTAT TATCCTATTA AATGATCATT ATACGCTAGG 720 

TAGCTACCCG CAAATCGGTA CAATCGCAAG TTATCATTTA ACGAAATTAG CACAAAAACC 780 

GCAAGGATCA CGTTTGAAAT TTCAATTTAT AGATATTTTA ACGG CTG AAA AGAACCTTGT 84 0 

TAAgTATAGT AACTGGTTAA AC 862 
(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA sATAAACGGT 60 

CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 120 
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240 
300 



CAGTTTTACG TCCTACACCT GCTAAACTTT CTAATTCCTT ATGTGTTTGT GGTATTTCTC 
CATTAAATTG ATCAATCAAA GATTGACAAA GTTTCTTAAT ATTCTTAGCT TTGTTACGAT 

ACAGACCGAT AGAACGAATA TCATTCATAA GTTCTTCATC ACTGACTGCC AAATAATCTT 36 0 

CAGGCGTTTT GTATTTTTTA AACAGCTCAG TTGTTACTCT ATTTACTAGA ACGTCTGTAC 42 0 

ATTGCGCTGA CAATAATACA GCAATAGTTA ATTCGAACGG ATTATCATGT TTTAATTCAC 48 0 

ATT CTG CATC CGGAAACATA TTTGCTATAA CATCAATCAT TTCTAATGCT TTTTTCTTAC 54 0 

TTACCATCAA GGTTCTCCCC GTTTAACCAA TCAAATTTAG GTACCGTTTT AACTGTGTGC 600 

GTCATTTTCG GTTTATTGAA TTTTTCTCTT ATTTTTCTAG AATCGTCAAT TGTTTTGACA 660 

TTGTTTTTCT TCCAATTAAG TAAAATACGA TCTATATATT TAAAGCTAAG TTTATTCAAA 720 

CTATTCGCCT CGTCTAATGC CGCTTGTATA ATTGCAGTAT CGTGTTTATC AACATCAATC 780 

CATTGATTTA ACGTTTCTAT TTCATATGGA GATAACGGCC TTGCAAATGT ATCCTCTAAA 84 0 

ACTCTAAATA ATTGTTTAAA TTTTTCTTTA CTATTTAGCT CTTTCGTTTC CATACTTTGT 900 

TGCTTCAATA TATGACTTAA TTTTTCGAAA AAAGGATCTA GATTCATATA TTCGGkAAAT 960 

CTACCTTCyT CATCyTTTTG aACTkGtAAT tCTAGCAATT CACGTgTATC AAATTTTGGA 1020 

TACCATT 1Q27 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

AAGGnTTGGA GGGAATTAAT nGATGGCAAA TCCCAGaTAA AGTACACaAC G CATGGAATG 6 0 

TGATGCATTT AATATTTCCA GTAGTTAGTA CGACGTTTGC AAGCTTTAAA TCTATGTATG 120 

GGGGCATACC AAAAGATTTC ATAGACTACT TATTTATTGA TGAAGCAGGA CAAGCAATAC 180 

CTCAAGCAGC TGTGGGAGCA TTATATCGTT CAAAAAAAGT TGTAGCTGTA GGTGATCCGA 24 0 

TTCAAATAGA ACCGGTTGTG ACTTTAGAAA GTCATTTAAT TGATAACATT CGTAAAAATT 3 00 

ATCATG TTCC GGAATATCTA GTTTCTAAAG AAGCTTCTGT GCAGTCTGTT GCAGACAACG 360 

CCAATCAATA TGGTTTTTGG AAATCTGATG CTACTGATAG TAATCAAAAA ACCTGGATAG 42 0 

GCATACCTTT ATGGGTGCAC AGACGATGTT TAAAACCTAT GTTCACGATa GCTAaCCAAa 4 80 
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GGTATGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT G AAAG AG CAT GGTGAAAAaG 600 

TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 660 

GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATGTTAAAG C 720 

AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GT CGATAAAT 7 80 

CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGTAATAG 84 0 

GTACTGATAA TACCCAAGAT GGTGCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 900 

ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGCGACATGC AAAGAATACA 960 

GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 1005 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 60 

GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTTATATTA TCAGCAGATT 120 

ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATGAGAGT ATGAAAGAAG 180 

CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 240 

CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC AGTTAACTTT GGAATTGTTT 3 00 

ATGGGATAAG TGATTATGGT TTAAGTCAAA GTTTAGGTAT TACTCGTAAA AAAGCAAAAG 3 60 

CATTCATTGA TGATTATTTA GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 4 20 

TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTG CT ACATCGTCGA CGCTATATTC 4 80 

CTGATATTAC GAGTCGTAAC TTTAATTTAC GCGGCTTTGC TGAACGTACT GCTATGAATA 540 

CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 600 

AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATTT 660 
TT 

(2) INFORMATION FOR SEQ ID NO: 3 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



662 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

g TCATCCAAAT TTTGGAAATT CCACATTTTA CATATCGTAA TTTTTTAGGA AACTAGTGAA 6 0 

TATAACAAAT CCCTCCTCTC ATTTTTAAAA TAGATATATC ACTTCCCCAC TTTTACTTAA 12 0 

CTAAACTGCA ACGGTTCCTA ATAC CAAAAT CCTGCCCTCT ATTTTTATCA ATTCAAGCAT 180 

w ACTTATTGAA AAATGTTAAC GTTTTCTTGA TAATCATTGT AAGCGCATTT ATTTTATAAA 24 0 

CTAACGTTTG AAATATACTA CAGGAGTGAC ACGTAATGAC TCAAATTACT GAAAAAGAAT 3 00 

TAAAAAAGAA GTATTTAGAT TTACTATCCC AAAATTTTGA TACTCCAGAA AAACTTGCAA 360 

15 CTGAAATTAT CAATTTAGAA TCAATTTTAG AATTACCTAA AGGTACGGAA CATTTCGTCA 4 20 

GCGATTTACA TGGTGAATAT GAAGCTTTCC AACACGTATT ACGCAACGGT TCTGGGAACG 4 80 

TGCGAGCGAA AATCAATGAT ATTTTCAAAG AGAGACTTTC AACTAAGGAG CTTAATGACT 540 

TAACTGCTCT TGTCTACTAT CCAGAAGrCm AATTtAAAAT TgATTAAAAG TGATTTCCAA 600 

AATTgCGGtC mACtTAATGt CyGGtATATC ACaACmATCG aACATTTAAT TGAGTTAATT 660 

AAATATTGT 669 



(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 9 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CACATGGCTG TTAGAGATAT GAATGGCCAT GCGTTACCTT TAACAAAAGA TGG CAATTTT 60 

TATCAAACGA ATGTAGATGC AAATGGTGTT AATCATGGTG GTAGTGAAAT GGTG CAAAAT 120 

40 

AAAACAGGTC ATATGAGTCA ACAArGC CAT ATGAATCAGA ACACACATGA ACCAACAGCC 180 

ACACATGCAA CAAGGTCATA TGCAATCATC AAACCATCAA ATGATGAGTC CAAAAGCAAA 240 

4s TATGCATTCA T CAAATCATC AAATGAACCA AAGTAACAAA AAAGTTTTAC CAGCTGCTGG 300 

TGAAAGTATG ACATCAAGTA TTCTTACTGC AAGTATTGCC GCACTACTAT TAGTATCTGG 3 60 

GTTATTCTTA GCATTTAGAC GACGTTCAAC AAATAAATAA ACATAATACG ATTAATAATA 420 

50 GAAAAATCGT GTGATTATCT GaGGGAGCCT AGGACATAAA TCAATGTCCT AGGCtCnCTA 4 80 

AtGTTATATT GGCAGTAGTT GACTGAATGA AATTGCGCTT GTAACAAGCT TTTCCATTTC 540 

£5 
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CGATTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT 66 0 

AAT CATAATT GGATAAGGTG tTTGTGCGTA CAGTGTTTCA ATAATCAGCC AACAATGTGT 72 0 

5 ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 7 80 

TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TCATGTCGTC 84 0 

TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 900 

AAATGCATCA GCTTGCTTAG AATGATTGTC CTTTTTTTGA TAATAGCGTT CCATTGCAAT 96 0 

GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA AATCAACACG 102 0 

ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 1080 

15 

TTGATCCATA CTTGAAGCGT GTTGGGCATT TGTTTG TGGA ATAACAAAGT GTCCCTTCCC 114 0 

TCTTGTACTC TCTACGATGC CATCTTCGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 120 0 

20 ACGACTGGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 124 9 

(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1788 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

AATGATGATT CATTCAAAGA AATTAACGCT CGGTATATGC TTGGTATTAC TCATTATATT 6 0 

GATTGTAGGT TATGT CATTA TGACAAAAAC AAATGGTCGA AACGCCCAAA TTAAAGACAC 120 

35 

ATTTAATCAA ACATTAAAAT TATATCCAAC CAAAAATCTC GATGATTTTT ACGATAAAGA 180 

AGGCTTTCGA GATCAAGAAT TTAAAAAGGG TGATAAAGGT ACTTGGATAG TTAATTCTGA 24 0 

AATGGTAATC GAGCCAAAAG GTAAGGATAT GGAAACGAGA GGAATGGTGC TCTATATCAA 300 

40 

TCGCAATACT AGAACCACAA AAGGGTATTA TTTTATAAGT GAAATGACAG ATGACAGTAA 360 

CGGCAGACCA AAGGATGATG AAAAAAGGTA TCCGGtAAAA ATGGAACATA ATAAAATCAT 420 

45 ACCAACGAAG CCACTACCGA ATGACAAGTT AAAAAAAGAG ATTGAAAACT TTAAGTTCTT 4 80 

TGTACAATAT GGCAACTTTA AAGATATTAA TGATTATAAA GATGGTGATA TTTCATATAA 54 0 

TCCTAATGTA CCAAGTTATT CGGCAAAATA TCAATTGAAT AATGATGATT ATAATGTCCA 6 00 

50 ACAGTTaAGa AAAAGATATG ATATTCCAAC CAAACAAGCG C CGAAACTAT TATTGAAAGG 6 60 

CGATGGAGAT TTAAAAGGTT CATCCGTAGT TcTAGAAGTC TTGAATTTAC CTTTGTCGAA 720 
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AGGTATGAGT CAAACTGAAT ATCAAATAAA ACCTGGTAAT ATAACAAGTA ACTCTGAAGA 84 0 

AACAAGTTCG ATATCTAAAG TGAG CTGTGA AATATAGGTA GCCATTTCAA AAAATTTAAA 900 

5 GGTGAATTTG ATAATGTAGC TCAAGGAGAT TGGGTTAAAA AGGCGAAGAA TGAACTGGAT 96 0 

GATATTAGTA AGAAATTAAA AAATATTCAA AGAACGGAAG TTTAATAGCT TATATG ATT C 1020 

TTGGAG CTAA GACAGCATGC GTTCATTCAT GCCATTATTA ATATAAGCAC CGCAACAAAA 1080 

to 

AAGCTTCTAA TGTGATACAG GAACCTCATA TTCCGTATCA TGTTAGAAGC TTTTAATGTC 114 0 

TAAAGAACAT CTACATTTTA TCATATTTTC TGACTTATTA AACTTTTATA TAATTAAATA 12 00 

TTTCTTAATT TTCCAAAATA GTGATAAATT TGTGAAATAC ATCACAAATC CCTTTATTTA 12 60 

75 

TTTGGAAATT CATGTAATAT TAGACTTGTA AGAAGTTAAT AAATAGAGAG AGACGAGAGA 1320 

GTTTATATAA ATACTATATA AACATTGGAG TGATGATTAT GAGAAAAGAG ATTGAAGCGC 13 8 0 

2g TTATTTTCTC AGACGTAAtA GCTATGATAT TTACGTGAAC ACTGGTGTAA ATCAAGGATT 144 0 

AATTGGTGAC ATCAAAGATG GTTACCTAAC TATTGATTCT ATGCCTTACA TTGATGCTGA 1500 

GCGTTTGTAT CACTTTGCTA TGGAACG TAA ATCGTTAGTC ACTAACTAGT TCTTATTGCC 156 0 

25 AATGATTACT ACCCCTAGTC GGCGGCAATT GAAGTGTGAT TGATGTAACT TGCCCTCGTT 162 0 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT TAAGTAAACC CTAACTCCCC ACAAATCTGG 16 8 0 

AACGATACTA AAAGCCACGT CCTATATTGG ATGTGGCTTT AGTCAkACTT ATATTATTTT 174 0 

30 tAAAACGATT ACCTACAAGA TTTACATATA AAATTCTATC ATGnCTGC 1788 

(2) INFORMATION FOR SEQ ID NO: 3 91: 

(i) SEQUENCE CHARACTERISTICS: 
3S (A) LENGTH: 24 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 6 0 

4S GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TAT CAGGAAT ATTATATAAA 12 0 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAG CATTTTT 180 

ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 24 0 

SO TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 3 00 

TTTGGAG TTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 3 60 
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TATGG CATTG GAGGATTATG TGCAGGTTTA ATTTCAGCTA TTCTTTCGAA GAAAATCTCA 4 80 

ACTAAAGTAT TGATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 54 0 

5 TGGATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGGCTA CTCAATTTCA 60 0 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTG TTTCAGATAA ATATGTCGGT 660 

CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 72 0 

10 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTCGGTT TCTATATTAT ACTCATTTTA 780 

TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCA C 84 0 

ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCGaT GGGCTTAGTT 900 

IS 

CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTATG GTCTTAAATA 96 0 

AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT TTAGAACCTG 102 0 

ZQ GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTG CTG GGCACTTTTA GTCG CCTTG A 108 0 

CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 114 0 

CTCCCATCAT CTTTCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 1200 

25 GTACATCTTT CTTAGTCGCC ATTGTAATCG CTCCTTCAGT TTTATGTTTA ATCACATTCG 126 0 

TATTAATGAT TCATTTCGTG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 1320 

TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT 13 8 0 

30 AGTTTGGAAG GTATGCGAAT AGCAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 144 0 

CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTTGTCTTT 1500 

GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA 156 0 

35 TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCTTTG 1620 

TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAGACTTCCC TTTCAAATGG 1680 

TTG CATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 1740 

40 

AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATGAAAGA AAATTTTTGG 1800 

AGTGAATTAC CACGTCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 1860 

4S TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 1920 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 198 0 

GAAGATGAAC AGCCGATGGT CGCT CAT ATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 204 0 

50 ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAG GCATAG ACTTAAATAT GGGATGTCCT 2100 

GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 2160 
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GGCTACTATG AAATCGATGA ATGGAAAGAT TGGTTGAAGC ACGTCTTCGA ACAAGACATT 22 8 0 

GCCAATTTAT CTATTCATCT TCGTACACGT AAAGAAATGA GTAAAGTAGA TGCACATTGG 234 0 

GAATTAATCG AAGCTATTAA AAATTTACGT GACGAAATTG CACCAAATAC ATTGTTAACA 24 0 0 

ATTAACG 24 0 7 



(2) INFORMATION FOR SEQ ID NO: 3 92: 

(i) SEQUENCE CHARACTERISTICS-. 

(A) LENGTH : 24 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 3 92: 





ATGATGnATG 


GCnCCGCCAA 


GAAGTTGAAC 


CAGTCTATTG 


GCTAGAATTG 


CTCAAAAAGC 


60 




GAGAcaTGTG 


GTATTCaTAT 


GTTaGTAgCt 


ACGCAAAGAC 


CATCTGTCAA 


TGTAATTACA 


120 




GGTTTAATTA 


AAGCCAACAT 


ACCAACAAGA 


ATTGCATTTA 


TGGTATCATC 


aAGTGTAGAT 


180 


25 


TCGAGAACGA 


TATTAGACAG 


TGGTGGAGCA 


GAACGCTTGT 


TAGGATATGG 


CGATATGTTA 


240 




TATCTTGGTA 


GCGGTATGAA 


TAAACCGATT 


AGAGTTCAAG 


GTACATTTGT 


TTCTGATGAC 






GAAATTGATG 


ATGTTGTTGA 


TTTTATCAAA 


CAACAAAGAG 


AACCGGACTA 


TCTATTTGAA 


360 


30 


GAAAAAGAAT 


TGTTGAAAAA 


AACACAAACA 


CAATCACAAG 


ATGaATTATT 


TGATGaTGTT 


420 




TGTGCATTTA 


TGGTTAATGA 


AGGACATATT 


TCAACATCAT 


TAATC CAAAG 


ACATTTCCAA 


480 




ATTGGCTATA 


ATAGAGCAGC 


AAGAATTATC 


GATCAATTAG 


AG CAACT CGG 


TTATG TTTCG 


540 


35 


AGTGCTAATG 


GTTCAAAACC 


AAGGGATGTT 


TATGTTACGG 


AAGCAGATTT 


AAATAAAGAA 


600 




TAATTATGAG 


TAAGGAGTTT 


TATATAATGA 


CACACTATCA 


TTTTGTCGGA 


ATTAAAGGTT 


660 


40 


CTGGCATGAG 


TTCATTAGCA 


CAAATCATGC 


ATGATTTAGG 


ACATGAAGTT 


CAAGGATCGG 


720 




ATATTGAGAA 


CTACGTATTT 


ACAGAAGTTG 


CTCTTAGAAA 


TAAGGGGATA 


AAAATATTAC 


780 




CATTTGATGC 


TAATAACATA 


AAAGAAGATA 


TGGTAGTtAT 


ACAAGGTAAT 


GCATTCGCGA 


840 


45 


GTAGCCaTGA 


AGAAAtAGTA 


CGTGCACATC 


AATTGaAATT 


AG ATGTTG TA 


AGTTATAATG 


900 




ATTTTTTAGG 


ACAGATTATT 


GATCAATATA 


CTTCAGTAGC 


TGTAACTGGT 


GCACATGGTA 


960 




AAACTTCTAC 


AACAGGTTTA 


TTATCACATG 


TTATGAATGG 


TGATAAAAAG 


ACTTCATTTT 


1020 


50 


TAATTGGTGA 


TGGCACAGGT 


ATGGGATTGC 


CTGAAAGTGA 


TTATTTCGCT 


TTTGAGG CAT 


1080 




GTGAATATAG 


ACGT CACTTT 


TTAAGTTATA 


AACCTGATTA 


CGCAATTATG 


ACAAATATTG 


1140 
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TGGCACATAA TGTTAAAAAA GGTATTATTG CTTGGGGTGA TGATGAACAT CTACGTAAAA 1260 

TTGAAGCAGA TGTTCCAATT TATTATTATG GATTTAAAGA TTCGGATGAC ATTTATGCTC 13 2 0 

5 AAAATATTCA AATTACGGAT AAAGGTACTG CTTTTGATGT GTATGTGGAT GGTGAGTTTT 13 80 

ATGATCACTT CCTGTCTCCA CAATATGGTG ACCATACAGT TTTAAATGCA TTAGCTGTAA 144 0 

TTGCGATTAG TTATTTAGAG AAGCTAGATG TTACAAATAT TAAAGAAGCA TTAGAAACGT 1500 

10 

TTGGTGGTGT TAAACGTCGT TTCAATGAAA CTACAATTGC AAATCAAGTT ATTGTAGATG 156 0 

ATTATGCACA CCAT CCAAGA GAAATTAGTG CTACAATTGA AACAGCACGA AAGAAATATC 162 0 

CACATAAAGA AGTTGTTGCA GTATTT CAAC CACACACTTT CTCTAGAACA CAGGCATTTT 1680 

IS 

TAAATGAATT TGCAGAAAGT TTAAGTAAAG CAGATCGTGT ATTCTTATGT GAAATTTTTG 174 0 

GATCAATTAG AGAAAATACT GGCGCATTAA CGATACAAGA TTTAATTGAT AAAATTGAAG 1800 

20 GTGCATCGTT AATTAATGAA GATTCTATTA ATGTATTAGA ACAATTTGAT AATGCTGTTA 1860 

TTTTATTTAT GGGTGCAGGT GATATTCAAA AATTACAAAA TGCATATTTA GATAAATTAG 1920 

GCATGAAAAA TGCGTTTTAA TATGTTTATA ATAGAGTAGT ATGGGTATTT ATTATTAATG 1980 

25 ACATTATTAC ATGTTAATTA GGAGGCGTTT TTAATGGATT GGATTTTACC AATTGCTGGA 204 0 

ATTATCGCTG CGATTGCATT CTTAATTTTA TGTATCGGTA TCGTAGCTGT ATTAAATTCT 2100 

GTTAAGaAAA ACTTAGATTA TGTTGCAAAA ACACTTGACG GTGTAGAAGG TCAAGTTCAA 2160 

30 GGTATTACTC GTGAAACAAC AGATTTACTT CATAAAGTAA ACCGTTTAAC TGAGGATATC 2220 

CAAGGTAAAG TAGATCGTTT AAACTCAGTT GTAGATGCTG TTAAAGGTAT CGGTGACTCA 2280 

GTACAAACGT TAAACAGCTC TGTAGATCGT GTAACAAATT CAATTACACA TAATATTTCT 234 0 

35 

CAAAATGAAG ATAAAATCTC ACAAGTTGTT CAATGGTCAA ATGTTGCAAT GGAAATTGCA 2400 

GACAAATGGC AAAATAGACA CTAC 2424 
(2) INFORMATION FOR SEQ ID NO: 3 93: 

40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 93: 

50 CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 60 

GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGG CAGA 12 0 
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AGTGTAAAGT TTTCAACATA ATACTATTAG TTCGGTCATG TATCGGACTG ATGGAAAAGC 24 0 

GTTTCACTTT TAATGACTCA TTAAGAACGG CCTGAAAATG TTTGGCGTAT TAAGTGCAAT 3 00 

5 GATAGTTTTG ACATTTAGTT TCTAATTGGT CATTACTGCC GAGCAAATCT AGTAGAGTAA 36 0 

TCATGTAAAT CTTTAATGTG CCATTTGATT CACTAGCGGT GTTAATAACT ACGGAAATTG 42 0 

CATTTCCGAC TGAAATTTTT GAAAAATATC AACGTACGCT ACAAATAAAA TTTTAAACTG 4 80 

TTATAAATGT GTCTCAATTT CATATGTTCA TCGACGATAT GAAGCGTATT ATGGTAAAAT 54 0 

GAAGAAATAA TAAACTTGTT AATAAATAAA ACATCACGAT TTGACTAAAG CACTTTATTA 600 

1S TTGTGTAGAT AATAGTTTTT TAACGAAATA AAAATGGCGA CTGGTTTTAA TAAATCAGCT 660 

AATGAATCAC TACACCTATA AGTATGAATA TAGTGATTAG AATGCTTTGT ATAGTTGGAT 720 

TTTGCAAAAT TGATGTTA 73 8 
20 (2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 

25 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94: 

30 

AAAAGTTGTA ATTAAAAGTG GGATTTTACT TAAGnnAGAA GGAAACTATT TATATGACTA 60 

ATAAAAGAGn AGATGTCCGC AATAT AG CAA TTATTGCTCA CGTTGACCAT GGTAAAACAA 120 

CTTTAGTAGA TGAGTTGTTA AAACAATCTG GTATATTCAG AGAAAATGAA CATGTCGATG 180 

35 

AACGTG CAAT GGACTCTAAC GATATCGAAA GAGAGCGTGG AATTACGATT CTAGC CAAAA 24 0 

ATACGGCTGT TGATTATAAA GGTACACGTA TTAATATTTT GGATACACCA GGACATGCAG 3 00 

40 ACTTTGGTGG AGAAGTAGAA CGTATTATGA AAATGGTTGA TGGGGTTGTC TTAGTAGTAG 3 60 

ATGCGTATGA AGGTACAATG CCTCAAACAC GTTTTGTACT TAAAAAAGCG CTAGAACAAA 4 20 

ACCTGAAACC TGTTGTTGTT GTTAATAAAA TTGATAAACC ATCAGCACGT CCAGAGGGTG 480 

45 TTGTAGATGA AGTTTTAGAT TTATTTATTG AATTAGAAGC AAACGnTGAA CAATTAGAAT 54 0 

TCCCTGTTGT TTATG CTTCA GCAGTAAATG GTACAGCTAG CTTAGATCCT GAAAAGCAAG 6 00 

ATGATAATTT ACAATCATTA TATGAAACAA TTATTGATTc ATGTACCAGC T C CAATTGAT 660 

50 AACAGTGATG AGCCCATTAC AATTTC CAAG TAGCATTGTT GGACTACAAT GATTATGTTG 720 

GACGTATTGG TATTGGTCGT GTATTCAGAG GTAAAATGCG TGTCGGAGAT AATGTATCAC 78 0 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTGGAGA TTTAATTGCT GTTTCAGGTA 900 

TGGAAGACAT TAATGTTGGT GAAa CTGTAA CACCACATGA CCATCAAGAA GCATTG CCAG 960 

TTCTACGTAT TGATGAGCCT ACTCTTGAAA TGACATTTAA AGTTAACAAT TCTCCATTTG 102 0 

CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGTTTA AATCAACAAT 10 80 

TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TGGGTAGTTG 114 0 

CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 1200 

AATTACAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTGTGAAC 126 0 

CATTTGAACG 12 70 
(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

AAGACCAGGA GAACAmGTAA AACAATATAC AGTTGaAATC GCTCGTAAAT TAATGGAATT 60 

TGATATAAAA TGCTCGTGAT TGCTTGTAAA TACG CAACTG CTGTnGCTTT AGAATATTTA 120 

CAAAAGACCT TATCAATCCC CAGTGATTGG CGTAATTGAC CAGGTGCTAG AaCAGCAATA 180 

ATGACTACTA GAAATCAAAA TGTATTAGTA CTAGGAACGG AAGGCACAAT TAAATCTGAA 24 0 

GCATATCGTA CGCATATTAA ACGTATCAAT CCACATGTAG AGGTACATGg CGTTGCCTGT 3 00 

CCAGGTTTTG TGCCACTTGT AGAACAAATG AGATATAGTG ATCCAACAAT TACAAGCATT 360 

GTCATTCATC AAACACTGAA ACGTTGGCGT AATAGTGAGT CTGATACTGT CATTTTAGGA 420 

TGTACCCACT ATCCATTGCT CTATAAACCT ATCTATGATT ATTTTGGTGG TAAAAAGACA 4 80 

GTGATTTCGT CTGGATTAGA AACGGCTCGT GAAGTTAGTG CATTGCTAAC ATTTAGTAAT 54 0 

GAACATGCAA GTTATACTGA ACATCCAGAT CATCGATTTT TTGCAACAGG TGATCCTACT 6 00 

CACATTACTA ACATTATCAA AGAGTGGTTA AATTTATCTG TCAATGTGGA ACGTATATCA 660 

GTGAATGACT AGGAGGATTT TTAATGAAAG AGATTGTTAT TGCATCGAAT AATCAAGGGA 720 

AAATAAATGA CTTTAAAGTA ATATTTCCAG ATTACCACGT AATAGGTATT TCAGAACTAA 7 80 

TAC CAGATTT TGATGTGGAA GAAACAGGAT CAACATTTGA AGAAAATGCT ATATTAAAAT 84 0 

CAGAAGCTGC TGCAAAAGCA TTGAATAAAA CGGTCATAGC TGATGACAGT GGACTAGAAG 900 
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GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATA CAACT GATCGTCGTG 102 0 

CGCAATTCGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GGAAACAAAA GTATTTAAAG 1080 

GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCGA AAATGGTTTC GGATATGAT C 114 0 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 1200 

GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGCTTTTCTT GAAGGTGATA 126 0 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 1320 

ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 13 65 
(2) INFORMATION FOR SEQ ID NO: 3 96: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 base pairs 

(B) TYPE-, nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



25 


AATTCCTGGT 


GCAATAATAA 


ATAGGATGAA AAAGATTCGG AAAATATGAT AACTCGTAAT 


60 




CATAGCAACA 


TCGGCACCAG 


TAGCTAATGC AACTAAAACT ATCTGATTAA CCCCTCCTGG 


120 




TGCTGCACCA 


AGaAACAATT 


CATTAATAGG ATTATTATCA AAGAAATGTA TGATATAAAC 


180 


30 


CATGATTAGC 


GCACCAATTA 


TCAACATAAT ATTTTGAATT GTAATTGCGA TTGCTAGTCT 


240 




ACCTTTTAAA 


TCTGACAATA 


AATGCGCAAT TTGAACTCCA ATTCTAATCA TATATATTAG 


300 




TTGTGCCATG 


TTCAACAACC 


AATGATCTAG TGTAAATGTT AAACCTGTAG AAAAATTC CA 


360 


35 


AACAATTAAT 


ACAATGAGTG 


GTGCTAATAA TTGAAATGTT GGAAACTTTA TTTTAGACAT 


420 




AATTAGATAA 


ACTATAAAGA 


TAGCTATCGC TAAAATAACT ATTTGCCCTA TGTTTAATAC 


480 




TTGTGATAAA GGCAAGACTT 


TTGTTAACTT TCCATTCGCA TGCATGTTAC CATCATGAAA 


540 


40 


AAAATATGAA 


ATGAACGGTA 


CTAAAACAAC AACAAATATA ATTCGTGATG TTTGCGTTAA 


600 




GCTAACAACT AACAAATTAG 


CACGTTTGTC TTGTTCAGCC ATGACCAGCA TTTGTGTTAG 


660 


45 


TGCTCCTGGT 


ATAACACTTA 


AAATAGCTGT TTCTGTATTA ATACGTGCAA TTTTTTTAAA 


720 


AACAAATGCC 


ATTACTATTG 


CAATTAATAA TATCGAAATA GATACAACAA TAAT CGAAAG 


780 




CCAATTGTTT 


TTAATATCCA 


TAACGACATT TTTCGTAAAC GTTGATCCGA TTTGCACACC 


840 


50 


TAATAGTACA 


ATACCTAATT 


CACTAAGTAA GAATGGCCAT TTAATATCAA GTTTGAAAAC 


900 




TTTTACACAA 


ATGATTGATG 


CGATAATAGG ACCAAACATA AATGGAAGTA ATACGTGCGA 


960 
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TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA TTGCTGCTTG 



1080 



AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT t TGACTACAA AG CATATATG 



1140 



GaCTTTCACT ATCAAGTCGC CGCCCATGCC TT ATATA CAT TTAAAAnGAG CCTGAACAAA 



1200 



GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 



1260 



TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTACCTTCA 



1320 



AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TG CAATAATT 



1380 



TCC 



1383 



15 



20 



25 



30 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 415 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CTTAACGGGT 6 0 

ATGATTAAGA ACCACGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTGAGG CGC 12 0 

CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 18 0 

GAGCATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 24 0 

CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 3 00 

GTATTTTAAT GGCkGTATtA TGGGCGATTA TTGCCAAAGA CTTACCTGaG CrACATAGwa 36 0 

TGGTTAATGA AGCGGAGAAA CGTTT CATTA TGGAAAATCG TGATATCGTA GCTAC 415 
(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 398: 

TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TT cGCCTTTT 60 

aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG AC CAGAATAA 120 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTCTAC TTTTTCATCA 180 
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AATGTGGGGT 


CGTATAAGAG 


CTTGTAATG C 


TTTTTGGCGT 








CCTGCCCACA 


CAGATCGTGC 


ATAGT CAACA 


G AAAGA CC TG 


CAT CGTTTAA 






CTGAAAGCAG 


TACATTGCGT 


AGTAGTGATT 


TTGCCAATAG 


GGATATTTCC 


GAACCTTTCT 




TTTATGTGAG 


TATTATATTC 


TGTAGTTCGC 


TTTTCTATTG 


AGCGTGCAGA 


AAGATTTTCA 




TTTTTTAAAC 


GATCAAAAAA 


TATATATTCA 


AAGGGTTGAT 


TGTCCGAGTA 


TCCATATTTA 




ACATTTTGTA 


TAAATTCGCT 


TTCAGCTAGT 


TTGGCATCTT 


TCTTACGTTC 


AAACCCACGC 




TTCATTTTTC 


GTTTGTTATT 


ACCGTATACA 


TCTTTATATC 


TAATGGAAAA ATACCATTTA 




CCTGTATTAT 


CATCCTTATA 


TACTGGCATT 


TTGCTTCTCC 


CTCCTCAAAA 


TTGGCAAAAA 




ATAATAAGGG 


TAGGCGGGCT 


ACCCGAAATT 


TAGTACTAGG 


TACTAAATGT GATATAATAA 




AATAAAAAGT 


AGGTGATGTT 


ATGACATTTA 


AAAACAATCA 


TAATTTCAAT 


GAATTAGTTT 




TAACGAATGA 


AGACATTAGA 


ATTTTAAAAA 


ATGTCTTAGA 


AGATGCAGTC 


AGTGTTTATG 


900 


ATGAATATTC 


GGTATGTAAT 


GAAGAATCCG 


ATTTTGCTTA 


CTGTTTATTA 


AGAGACTTAT 


960 


ATACATTAGA 


CAGCTTAGCT 


ATTTCGTCAA 


ATAATGTTTG 


AATTATCGAA 


TTGTACTCTT 


1020 


CGATTTTAAT 


ACCATGCATA 


ATAGAGTTTC 


TGTGTTCAAT 


AGCAGCTTTG 


ACTGAATGTk 


1080 


TTAAATGTTC 


TTCTATTAAA 


TCGTTGTTTk 


CCAtTTCGtk 


TAaAAATGTt 


CyTATATTCC 


1140 


T 












1141 



30 (2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

40 ACTGTGGTAG GTTTTTTATT TTGAAGTATT AATCATAACA GACTAATAAT CATGAGGTAA 60 

CTAATAACAC ATATTTAACT TGTATTCTTA AACTGGTATA ATAAATTTAT GTTGAAATGA 12 0 

ATATTGTATG ACAGGGTATT CACTTTTATT AAAAGGTAAA ATTAAATAAA GGTTTTATAG 18 0 

45 

AACGTATTTA AATATATGAG GAGTAAACAA ATGGCTGATA GAACGAATAA AGAAATTAAA 24 0 

ACAGGACGCT TTATTGCAAC TG CAT CAAT C GTATTCTCAA TATTATTGAT TATTCATTAC 3 00 

so TTTGTTTCGT TGGATAATGC GACTGCCAAA GCATTACTTA ATTTAACGAA TCAAAACACT 360 

TCAGATAAAG CGATTGATTA CATTTTAAAC AGCTTTAGAT TCACTGGTAT TATGTATATT 4 20 
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ATGTTTGCAG TTTATGTATC AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 54 0 

CAAGCAATAA AAG CTG CACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TGTTATTATA 60 0 

5 GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 66 0 

AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 706 
(2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
is (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

20 ACACAATCTG AAGATTCACG TTGTGGTGCT GGACATGATC GAAAAATTAG AGCTGAACAA 6 0 

ATGAnAGAAA TCAGTGATTT TGTTAAAAAG AAAAATATCC CTAAAGATGA AACGGTATAT 120 

ATAGGTGGCG ACCTTAATGT CAATAAAGGC ACT C CAGAGT TCAAAGATAT GCTTAArAAC 180 

25 TTGAATGTAA ATGATGTTCT ATATGCAGGT CATAATAGCA CATGGGACCC TCAATCAAAT 24 0 

TCAATTGCGA AATATAATTA CCCTAATGGT AAACCAGAAC ATTTAGACTA TATATTTACA 3 00 

GATAAAGATC ATAAACAACC AAAACAATTA GTCAATGAAG TTGTGACTGA AAAACCTAAG 36 0 

30 

CCATGGGATG TATATGCGTT CCCATATTAy aCGTTTACAA TGATTTTTCA GATCATTACC 420 

CAATCAAAGC CTATAGTAAA TAGTGCTCAA CTAACTAATA ACTTGCTtCG TTCTAAAAGG 480 

35 ACGAAGCGAG TTATATTGTT AAAATTTGAA TTGACTTACA TTTTAATAAA ATCATCTTAA 54 0 

CAACTTTAAT TTTTCaTTAA TACAaGTCTT TACTCTACAC TCAAACnAGA TTCATACACT 600 

GCACGTCATA ATAAATCTAT CTATTCAAAT ATAAATAAAA GTTACCTACT ACATTCTATG 660 

40 TAG CAGGCAA CTTTTATTAC TTATTTCTTT TCATTATCAT TAAGTACTTT TACAAACTTC 72 0 

ACATTATGTG TCTTCCAATC AACTTCATAT AATGCTGATA ATTTTTCTTC TTTTTTATCT 780 

ACATGGTTTT CACCAGACCA ATAGCCCCAG AAAC CATGGC GATTCCAATC TATTTTAAAC 840 

45 

TCATCCATTG ATCTTTTATA ATGAACAACA AATTGTGATT TACCTTTGTC TTTTTTATCA 900 

TGTGACATAA CAGCTAAAAA T TCTGGATTA AACCCTTCAG ACACAGTTAC AGGCATTTTG 960 

TCTTTAGGTG TGAAATTATC TTTCGCCCAT AAATTTCCAT TTCGTGTTAA AGAAAAAATT 1020 

50 

TCACTTTTAG TTCTATTATC ACTATCATTA GTTAATTGTC TCGTATGGTC ATGTCCCATA 1080 

TTATTTATCA AATGTGCTTC TACTTTC CAA CCTACACCTT TATGTGACGT AGATTGATCA 1140 

£5 



1294 



EP 0 786 519 A2 



(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 
CCAGAATTAT TTTTTCAAAA AGGACAATTT AACAATGTCG ATAACGTTAT CATAAGCAAA 

)5 CCGATGAAAG GGACAATGCC TAGAGGTAAA ACGGAaG CTG AAGATCAACA GTATTATAAA 

ACATTGCAAA CTTCTTCGAA AGATCGTGCA GAAAATGTCA TGATTGTTGA TTTACTAAGA 
AACGATATAG GGAGAATATC ACAGAGTGGC TCAATTAAGG TGTATAAACT ATTTTTTATT 

20 GAGGCATATA AAACTGTATT TCAAATGACT TCGATGGTAA GTGGAACTTT AAAAAATAAT 
ACAGACTTAA CTCAAATTTT AACATCGTTA TTTCCTTGTG GTTCGATTAC AGGTGCACCG 
AAACTGAATA CAATGAAATA TATTAAACAA TTAGAAAGTT CACCTCGTGG TATATACTGC 

25 GGACAATTGG ACTATTACTT CCAACTGAAG ATGATAAAAT GATTTTTAAT ATTCCGATTC 

GCACTATTGA GTATAAATAT GGACAAGCGA TTTATGGAGT CGGAGCAGGT ATTACAATTG 
ATTCTAAGCC AAAAGATGAA GTGAATGAAT TTTACGCAAA AACCAAGATT TTGGAGATGT 

30 

TATAATG CAA TTATTTGAAA CAATGAAAAT TGATAATGGA CATATCCCTA GACTTACTTA 
TCATACTAAT CGCATAAAAT GTTCTTCTGa GCGATTAAAC TTTAAATTTG ATGAACATGC 
35 ATGGCGAAAT GAATTAAACG ATGTAACAAC AAAGTATCAC AGTGGTCAAT ATAGACTTAA 
AATCGTATTA AATGCTGAAA GCAAATTTGA AACGATAGTG TCACCTTTAC CTGAGAAAAG 
TAGTTTT 

*0 (2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 0 base pairs 

(B) TYPE: nucleic acid 

45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

so 

TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 60 
GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAGt TTAT CATCATGAAA 12 0 
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GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 24 0 

GAAGCAAATG ATGTTCTTGT TAATGCTGTT TATG CAGAG A AATTAATTCA ATATGGAAAT 300 

AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 36 0 

AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 42 0 

CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 4 80 

TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 54 0 

AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 60 0 

TAAATAGCAA TTGTTTAAAG ATAAACGTAA TCAAATGTGT TGTTTTAATT AATATAAGTA 66 0 

GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 720 

ATAAGTAGAA ATAAGATTAT 74 0 

(2) INFORMATION FOR SEQ ID NO: 403: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

ATGCCAATAA TTAAACCATG TAATAAATAT ACGTATAGCG TACG ACTAC C AATATAAGTA 60 

TATAATTTTT TCTTTGTTGA CATTAAATTT AGAAACG CAG TCATTGCGAT TAATATAATT 12 0 

CCATATAATA TAAGTCGTTT AAAAGGACTG AATATACTCT GTCCTTCATT TTCAAGTGAA 1B0 

GTATATGGTG AACTTCCCAA TAACCAATCT GCATTGATAG GATGAATCAC GTAAACGATA 24 0 

AAAAACAAAA TAAAGGTAAT GATAGATACT GGTATTAGTT TTTTATTTTT AAAAATAGCC 3 00 

GTATGTTTTT TGGTGAAAAT GTAACCTAGA TAAAATATTG GGAAAAATAC GATTGTCCTT 3 60 

GAAATGCTTA AGTAGCTATC GATGTTATCT GAAAAACCTG CTCCAATAGA TATAATAATT 42 0 

GAAACTGATA GCACTTTATA TGGATTAAAT CTTCTAACTA TTACTAAAAT GACATGAAAG 4 80 

AAAAATAGCG TGATCAAAAA CCATAACGCA AATACTGGGT TAAAAGGATC AAGTTGTAAT 54 0 

TCGTCACTTT TACCTGTTAA GAAATAATAA ATTGAAAAGA ATGCAAAAAA TATCATATAA 600 

GGTACTATCA AACGTTTTGA AATTTTTTCT 6 30 

(2) INFORMATION FOR SEQ ID NO: 4 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6254 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

ATTTAAACGT TCAGTTTCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC GTGAATTTTA 60 

CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 120 

ATAATTAATA CCTCTGTTGA CTCCCTCAAC ACGAATATTA ATTATATAAA ACAAACATCA 180 

CAAGTTAGTG TCTGACACTA ATATGTGATG TTTTTTTGTT GTCAATTTTT AATTAAAAAA 240 

AGTTATATAG TTTATAAATA AT CAAATT G A TATTCTATAG GTTCTTATAA CTATAAAGTA 300 

TATTCAATTT CATGTATAAT TAATGTGAGG GCGAGGTGAA ATTGTGAGTT ATAATAATTT 3 60 

TTTACAAATG ACAACTATCT TGGAATCAAC GGCTGGAGAT ACTTGGGTTG AACAAGTTAG 420 

CAATATAATT GTTCAACCTA TTTT TA CGTT AATATTAACC TGTTTGACAT TCTTAG GATT 4 80 

TGTATATCAA CTTTACTCTA AAAAAATCAA TGCAGCTGGT ATTATCGCCA CATTATCATT 540 

ACTTATTTTA TTTTTGGGAT TTCTAATCCA AGGAAATGTC AATATGCATT CTATCTTAAT 600 

ATTCTCAATT GG CGTTAT AT TAGTTGTAAT TGAATTATTT GTAGTTGGTG CAGTAATTGG 660 

TATTATTGGC ATGATACTGA TAACTATAAG CATAACAACG CTCGGTGATA ATTTGCTATT 72 0 

TATGCTTGCG AATGTTATCG TTGCCTTGAT TTTAACGATT GTAGAATGGG TGATATTAGT 780 

GAAGATTTTC AACAGAAAGA TTCCGTTTTT GGATAAAGTT ATCTTAAAAG ATTCAACTAA 84 0 

TTCTGAGTCA GGTTACAATT CTCATGATAA CCGCTCGCAC CTCGTAGGAA AGACTGCTCA 900 

AACAGTTACA GATCTTCGAC CTGCAGGGAT TATTTTTTGT GAAAATGAAC GTATTGATGC 960 

TGTTTCAGAT GGCAACTTTA TTTTGCGCAA TAAAACGGTA AAAATCCTTG AAGTTGAAGG 102 0 

AACAAGAGTA GTTGTGAGGG AAG TAGATT A ATTAAAAGGA GCGATACCAT GTTTAGTTTA 10 80 

AGTTTTATCG TAATAGCAGT TATTATAGTA GTTG CATTAC TTATTTTATT CTCATTTGTA 114 0 

CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT AGGTACATTG 1200 

GTTGGTATGC GTTTACGTCG TGTATCTCCA AGAAAAGTTA TAGCGCCATT AATTAAAGCG 1260 

CACAAAGCAG GACTAG CATT AACAACAAAC CAATTAGAAT CGCATTATCT AGCAGGAGGA 1320 

AATGTTGACA GAGTTGTTGA CG CTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 13 80 

TTCGAACGTG CTGCTGCAAT TGaCCTTGCA GGACGTGACG TATTAGAAGC GGTTCAAATG 144 0 

TCTGTTAATC CTAAAGTCAT TGAAACACCA TTTATCGCAG GTGTAGCAAT GaACGGTATT 150 0 

GAAGTGAAAG CCAAAGCTCG TATCACAGTT AGAGCTAATA TTGCTCGACT TGTTGGTGGT 1560 
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AGTAAGCATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTTTTAAGC 1680 

AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 174 0 

5 ATTAGTAAAA ATATTGGTGC AGACTTACaA ACTGAACAAG CATTAGCAGA CAAAAATATT 180 0 

GCACAAGCAA AAGCTGAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGAAATGAAA 1860 

GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCATT AG CT 19 20 

10 

ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTATTATAA TTTGAAAAAT 19 80 

ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 2 04 0 

)5 GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

CATATCAGTG ATCATTTCTA TCATTACTAC T ATG CGCGAA AATAGTCATA AAGATAGACA 2160 

AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTGGCTTTTT 222 0 

20 TGAAGAAATT GAGCGAACGT TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 22 BO 

ATCATCGAAA CGAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 2 34 0 

AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA 24 0 0 

25 ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 24 60 

ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT CACTTAGAGA 25 20 

TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 25 BO 

30 

GGAAAAACGT GCTAGAGATA TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT 26 4 0 

GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA GATTTGACTA AATCAGCGTT 2 70 0 

CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAGCTAAACC 276 0 

35 

AAAACAATTA TAAAATTTTT TGAAAACAAG CACTATCGTA ATGGTAGTTG CTTGTTTTTT 2820 

TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTTAAATT 28 80 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 2 940 

AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 3 000 

AATTGCATGA TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 3 06 0 

45 GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 3120 

TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 3180 

CCATGCAAGA GGACAAGaAG TTGCCGTTAA AGGTACAAAA ATAGAAAACG TAGAAAAAGC 324 0 

50 

GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 33 00 

AGATG TTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATTCAACATC TGTTAGATTT 3360 

55 



1298 



EP 0 786 519 A2 



GCAACGTATA TATGTTAATG CCATGAAAAA TAATGATTTA GTATTTGGTA TAGGTCCTGC 34 8 0 

TGGTACAGGT AAGACATTCT TAGCTGTAGT TTATGCAGCA AAGCAACTCC GTAAAGGTGC 3 54 0 

5 TGTTAAACGT ATTGTATTAA CAAGACCTGC TGTTGAAGCA GGAGAGTCAC TTGGATTTTT 3 600 

ACCAGGAGAT TTGAAAGAAA AGGTAGATCC ATATTTAAGA CCTTTATATG ATGGTCTATA 3 6 60 

TACTGTTCTT GGGCGTGAAC AAACAGAGCG ATTTATTGAA AGAgGCATTA TCGAAATAGC 3 720 

W 

GCCACTTGCA TATATGCGCG GACGAACATT AGAAGATGCA TTTGTAATTC TTGATGAGGC 3780 

GCAGAATACG ACACATGCGC AAATGAAAAT GTTTTTAACA AGACTAGGTT TTGGCTCAAA 3 84 0 

15 AATGGTAGTT ACTGGTGACC AAACTCAAAT CGATTTACCT AAAGGTGTTA AAAGTGGACT 3 900 

TAAGGAAGCG GTCAGTAGGT TACACAACGT TAAAGGTATA AGTATATTGA AATTAGATCA 3 9 60 

GAGCGATGTA GTCAgACATC CATTG GTAAG TAAGATCATT GAACATTATG AAGGAGAGAA 4 02 0 

20 TTAAATGTTT ACGATAGATT TTAGCGATCA CACAGGCTTA GTTAAAGATG CTTGGTATAA 4 0 80 

ACAAATTGAA GATTTATTAG AATTTGCTAA AAAAGAAGAG CATATAGAAG ACGATGCTGA 414 0 

GCTTTCTGTT ACATTTGTAG ATAAACAAGA AATACAAGAA ATTAATCGAA CATATAGAGA 42 00 

25 TAArGwTAAr GTTmCaGATG tAaTCyCaTT tGCTTTAGrA GrAGATGAGC CmGaGATkGA 42 6 0 

TtTTAGTGGT CTTGATATAC CACGTGTTTT AGGGGATATA ATTATCTGtA CGGATGTAGC 432 0 

GCAAGrACAA GCAAACAATT ACGGACATTC TTTTGAACGA GAATTAGGAT TTTTAGCATT 43 80 

30 

ACATGGATTT TTGCATCTAT TAGGTTATGA TCATATGACT GAAGCGGATG AAAAGGAAAT 444 0 

GTTTGGTCGA CAAGATACAA TATTAAACGC ATATGGATTA ACACGAGACT AATTATGAAA 4 5 00 

^ AGGTTTAAAT ATGCACTTGA TGGGCTGAAA ATCTTAATTC AAAAAGACTA TAAATTTCTT 4 56 0 

TTACATGTGT TTGCAATGAT TGTTGCTATT GTCTTTGGTC TCGTACTAAA TATTAATCGG 4 620 

ATTGAGTGGA TATTTATACT CATTG CTATT GCATTAGTTC TCACTGTTGA AGCTTTAAAC 4 6 80 

40 ACTGCTATTG AATATGTTGT CGATTTAGTG ACCGTTGAAT ATCATGATTT AGCTAAATAC 4 74 0 

GCTAAAGATA TTGCGG CTTT TAGTGTACTT ATAGTTTCAA TATTAGCATT TATTATAGGT 4 80 0 

TTAATAGTAT TTTTACCACA TTTTATAGCG TTATTTTAGG GAGGCATATA TGAGTTATCA 4 860 

45 ACCTCATTAT TTTCAAGAAG TTAGAAAAGC ACAACAAGAA TCATATTCGC CATACAGTCA 4 92 0 

ATTTAAAGTA GGGGCTTATT TAAAAmCGAA AGACgGTAGA ACTTTTTATG GTACCAATGT 4 980 

AGAAAATGCT TCTTATCCAT TATCGATATG TGCTGAACGA GCTAGTTTGG TATCGGCAAT 5 040 

50 

TTCTCAAGGA TACAG AC CAG GTGATTTTGA ATCAAtAACT GTAACCGTAG ATGCAGATAA 5100 

AC CGTCATCA CCTTGTGGTG CATGTCGTCA AGTTTTGAAG GAATTATGTG ATGATGATAT 5160 
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ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 5280 

TTCAATTATA GGTAGACCAA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 534 0 

TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 54 00 

GACAAGAGAT GACGCGCAAA TT ATATT CAT TGATACGCCA GGTATTCATA AACCTAAACA 54 60 

CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 5520 

CATGTTTATG GTTAATGCCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT 558 0 

GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 5640 

AGATGAATTA ATGCCAAAGA TTGAAGAATA T CAAAGTTAT ATGGACTTTA CAGAGATTGT 5700 

ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTGATGTTT TAAAGACGTA 5760 

TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 582 0 

TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 5 880 

TGCGATTGGT GTTAATGTGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 5 940 

AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 6000 

AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 6060 

AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 6120 

TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 6180 

AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 6 24 0 

CACGATTTTA AATG 62 54 
(2) INFORMATION FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 6 0 

TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 120 

CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TGCTGATATA 180 

ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATT C CAT AAA 24 0 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 300 
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TGTATTTG TT CAGCTAATGG AAAACGACTA 
GCACCTGTAC TCGCTGTTAA AACTGCAATT 
5 TTCGCTTTAT CAAAACAACC TTCTTCAGTT 

TATAAGTTCC CTATACCATC TAATTG CATT 
CGTTCAAAAA CAGAAACACG TGCTTCTAAT 

10 

TTTAATTTAG AAGCACTACC TTTGAACATC 
ACATTAAAGT ATAAATGaCC ACTTGaATGc 

J5 AATACAGATT GGAGATGTGG ATCTTGATCA 

GAAACACTTA AATAATCTGA CATATAACAT 
TACACCATTT ATAAATTTAT AATGAT CATC 

20 TTCATTCATT ACGACTTTAG CAGGTGTATC 

TAAAATAATA CGATCCGTTT TTAATAAACG 
AATTGTCTCG TCTAATACAG GTTCGTGATC 

25 

TTCGAAGTCT AAATCTGGAT TATCGTCTTT 
ATCACTGTCC TTCATTTCTA ATTGAAATAA 
ACGACTCATT TTTAACTCCT TCAAACGTTT 

30 

GTATGACATT ACTTTTCAAT AACGATTTGT 
ATCGCTGTCA TATTAGAAAT TGAATTAAAA 

35 GAAATATTAA CACCATGTTT TAATGCACAA 

CTCTCGATTT TTAAATCACG GCTTAAATTT 
TTTAATTCAG CAAAATGGCC AGTGATGCCT 

40 ACAGATAGCA CTTCTGGCGC TATTTCTACT 

ACTTTGACCA TGGATTGACC TCCTATTAAC 
AAAAGTTTGT ATTAAATTTA CCGCTTCTAA 

45 

ATGGAATAGT TGTATCAATA CCAAGAACCA 
TAATCGCTTC ATCTCGTGTC GGTTCATGTA 
so ATGGCGGTAT CGTATAATTA GTATAACATG 

GTGCAAGATA TTGCTCmATT TTACCTGGTG 
CATTAATTCT AAATTCAATT GCGTGTCCTG 

55 



TTTATCGTTG AATGGATATC TCGAATTGCG 420 

TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 4 80 

AATTTTTTCT TTAATGCTTC TAATTTTTGA 54 0 

TTATTTACAT AAATTTGATA GTTTCCACGA 600 

AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 660 

ATGGCACTTA TAACGCTTTC TTTATCTTTC 720 

TTTTTGAAAT TTGAAAGCTC ACCTTTAATC 780 

AATTTATATT TAATATATTT CGTTAAAGCT 84 0 

CACTCAATTT TATTTTTTTA TATTACTCAA 9 00 

ATCACTGAAT TGTTTTG TTA ATTCAACTGC 9 60 

ACTGTGTAAT ATTT CATATG TTGCCATTCT 1020 

TGCAATAGTC CAATCTTTTA AATAAGGACT 1080 

TTTAACGCCA GAAACTAGCC AATGAATAAA 1140 

AATAAAGCTT ATCGCTTCAT TTATCGTTAA 1200 

AGTTTGAAAA GCTTGCACTC GGGATTCTTT 1260 

GTATTTTTCT TTATTTAATT ACTGAATTAG 132 0 

GTAATGTGAA TATTAATTTG CTTAGGTTCT 138 0 

ATTGACGTTT GAATTTTGTT TGCAGTTTTT 144 0 

TATACATCTA TATATATGCC ATCTTCTTTA 1500 

TTACGACTAA CTTTTTCTAA ATTTGTTTCT 156 0 

TCGACTTCCG AAGTAGCTAT ACTTGCAATA 1620 

TTACCTAATT TTGAATTTGA ATAATCAGTT 1680 

CTTCATCATT CATAATGCTA TTTTGCTCTA 1740 

ATATATCGTT ATTCAATAAT TTAATATGGA 1800 

CAAATTCACT TAGTGCACGA ATGCCAGCCA 1860 

TGATTAATTT CGCTACCATC GAAT CAT AAT 1920 

CTGACTCTAT TCGAACACCA TATCCACCTG 1980 

ATGGCATAAA GTTCTTGTAA GGATTTTCAG 2 040 

TTAATTTAAT ATCTTCTTGT TTATACGGTA 2100 
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CAGTTACAGG ATGTTCTACT TGAATACGTG TATT CATTTC CATAAAATAA AATTTATTAT 222 0 

CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 228 0 

CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 234 0 

CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACGTTCT CCTAAATGAA 24 0 0 

TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG TTTTCGATGA 24 60 

ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTTG AGCTTCTTGT TCTGTCATTC 2520 

GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACCGC 2580 

CAGCAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GGCGATTTTC TTAGCTTCTG 264 0 

AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 27 0 0 

CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 2 760 

ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 282 0 

AACCCGGATG AACGCCATCA CAACCTG TAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 2880 

TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTACGCAATA TGCTTCATCA GCAATTTGAG 2 94 0 

TATGXAGCGC ATCTTTATCC CCTTCAGAAT AG ATTG CAAC AGTTTGGATG CCTAAATCAC 300 0 

GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 3060 

TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTACTAAGAT 3120 

TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 3180 

AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 324 0 

TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTG TTT TGTGATTATC 3 3 00 

TGAAGTTGGC TTTGGAG CTT CAGTT TT ATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 3 360 

CATTGCCGCA GCTTCAACTG GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 34 20 

TTCTTTCTTA AGCGTCACTT TGCCTTTAGT ATCTTCAATA TTGATTTCCG TTAAAGTTGA 3480 

TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTGACTCCTT 3 54 0 

CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTGAGACAA CTCTTCAATT CAAGCATGTT 3 6 00 

CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 3 66 0 

AAAAACACTT GTGTATTTAT TACTTAACAT TGACTCATCT TAACTCTTGA 3 710 
(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1705 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

GCTGATGTTC GTTGCCTTTn TCCACCAGAC AATTCAGAGG GATATTTATC ACTAATATCC 60 

S 

AATATATTTA ATGCTTCTGC TACTTTTTCA TAACGATTTA ACATATGTTC TTTATCTAAC 120 

TTCTGTACCG TTAGTGGTAA CATTATGTTT TCTTTAACAG TCAATGTATG CAGTAAATTA 180 

io TACTCTTGAA AAATAAAACC AATATCATGC TTGCGTATAT CAGATwATTC CTTGTTTGAA 24 0 

AGCTTTTCTA ATT T TTTTCC TTTTAATGTA ATAGAACCTT GTGAAATATA ATCAATTGAA 300 

CTTAAAACAT TTAATAATGT CGTTTTCCCA GATCCAGAGG GACCCATAAT AGCAATAAAC 36 0 

' 5 TCGCCTTCTT CAATAGACAT ATTGATATCT CGCAACACTT CTTGTG C CAT TTTTTTAGTT 420 

CCATATATTT TTGTTAATTG TTTTACTTCT AAAATTGCCA CTTTAACACT CCTATAATTT 4 80 

ATCTTAACTT CATTTCCTTT AGGCTTTGGC ACTTGTATCT TCAATTTAAC ATATGACTAA 54 0 

20 

GATCTATCTT ATTATAACGT TGAAGCTGCA TTGATGTATC AATTCTAAGT AACAAAACGC 600 

ATGTTTAAAA TGACAAATTT GTCACTTCCG ACATG CGTTC AACAATTTCA TTTTG TAATG 66 0 

25 GGAAAATCAA TCTGACAGTT GTCCCCTTAC CAACAGTCGA CGTGACTTGC AGGTGAATAC 72 0 

CTAATTGATC CTTTACACTA TTTACTAAAT ATAGACCCAT AC C TGAA G AC GTCGTTTCAT 780 

TTCTGTTAGC CGTTGACGTA AATCCTCGTT CAAATATT CG CGGCATATCT TTTTTACTAA 840 

30 TACCTCTGCC ATAGTCTTTA ATATATAACG AAACATGTTG ATCATTTAAT TCTGTCCCAA 900 

TTTCAATATT AAAATTCTCA CTATATTTCA ATGCGTTTGA CAAAATTTGT CTAATAATCA 96 0 

TACGACACCA TTTTATATCT G T AT AAACAT AATCATCCAC TTTAAAGTCA ACATCAAAAC 1020 

35 

CAATACCTTT AACCTGACTA ATATGTCTTG TTAATTGTAT TTCATCAATG ACCATGCGTT 108 0 

TAAGTGACAC GTAATCAAAA TACATATCTT TACGTTGAGA TTCTAATCTA GTAATATACA 1140 

4Q GCTGTGTATC TAGCATCGAG TTTATACGAG ACCATTCATA TAGTAATGCT TGTwTTCTTT 1200 

CTTGATTTTT TTCTTGATCA ATTAATAATT TCATAGCTGT CACAGGtGTT TTTATGTCGT 1260 

GCACAAATTC TGTAATGGTT TGTTCATGCA TGTTCAATTG CAACTGTTGC TCAACAACCT 1320 

4S TTTCTTTGTG CGCTGAGATT TGACGATATA AATAATCAAC TGTATGACGT TGAAATGGCG 13 80 

TTTCCGCTAA ATC TTTATGT TTAATTTCTT CTATTTCTTT ATCTTTGTCA AAATGCTTAT 144 0 

ATAATTTTAC TTCTTTAAAA TATGT CAATA AAAGAAAAAT CATTGTTAAA CTTAAATTCA 150 0 

50 AAGAAACAAT ATAAAATAAA CTGTCTATTG GAAAATCATA ATCGATTAGA CTArTGCCTA 156 0 

ACATAAGGAA GTTTAAAAAC AATATCCAAA AtATCCAGTT CATGCGAGAT TTCAAAAAAT 1620 

55 



1303 



EP 0 786 519 A2 



GCACTATCCA TACTAATTTC AGATA 170 5 

(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

to 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 





TCAATATATC 


TTAAAATTCA 


ATGATTAAAT 


CnATTATCAC 


TAGACATnAA 


ATACATAAAT 


60 




CCTATTCCCC 


ATTTTCATTT 


nTTAATTCAT 


AAATGAATCA 


ATAAC CACCT 


AATAACAAAT 


120 




CATATTATAC 


ACCTTTGTTC 


TCTATTTTTC TAAGGTTTAA AAAATATTTT TAGGTAAACC 


180 


20 


TAAAAATAGA 


TGTAATAAAA 


ACGCCTCCTC 


AGATATTTAT 


ATATCTATGA 


AGACGTTTAA 






ATACATTATA 


GATGGTCTGG 


TTCTGGGTGA 


ACGTATACTG 


AGGAAAtACC 


TTTTTTGTGC 


300 




AAATGATGTT 


CGACATTGTC 


ACAAATTTGA TGCGCTTCTA CTAAGGAAAG 


GTTAGCATCT 


360 


25 


ACAACAATTG 


TGACATCAAT 


AAACACACTA 


CTTCCATGGT 


AACGCCCTTT AATACTTTTA 


420 




ACTTCTTGTA 


CTTCATCAAC 


TTCTAAAATA TCATTGCGAT ACG CTTCTAA TTCAGTTTCA 


480 


30 


TTGAAACCAT 


CACTCAACAT 


AAAAATTGCT 


TCTTTAAAAA 


TACCAAAACC 


AGTATAAACG 


540 




ATTAGTAAGC 


CT AGTAATG T 


TGCTAAAATA 


ATATCGACAA 


TTGGGAAACC 


GATTTGCGTA 


600 




AAAATTAATC 
GCCGAATTTA 


CTATCGCTGT 
AAGAACTACT 


TCCAATGCTG 


ACTAAACTAT 


CCGATAAATT 


ATCTTTGGCA 




35 


TTTCGTTCTT 


TTCGCTAGTC 


TTTGATTGAC 


TGCAAATACA 


720 




ATCAACATTA 


CAAGACCACT 


GATTAAGCTG 


ACGATAATTG 


TTATTGCGTT 


AGGTACAACG 


780 




TCATCTTCTT 


TGAACAAACG 


AGGTGCATTT 


TGAATAACTA 


CTTGGATACC 


TACAAACATA 


840 


40 


ATGACAAATG 


ACACCAATAA 


TGAAGAAATA 


TTTTCAGACT 


TCAAATGGCC 


ATAAGGATGA 


900 




TTTCGATCGG 


CAGGTTTAAT 


TGAAATTTTC 


AATCCAATAA 


TAACAGCTAA 


AGAAACGATA 


960 




ATATCTGTCA 


TATTGTTTAA 


TGCATCGGCT 


CTTACAGCTG 


CAGAGTTAAA 


GACAAAACCC 


1020 


45 


GTGACATACT 


TAACAATAGA 


TAAGATTATA 


TATACAATTA AACTCAAATA 


AGCACCGCGT 


1080 




TGCGCCAATT 


TAAGATTTTC 


ATTATGAGAC 


ATGCGTTGAA 


CCAC CTTGAA 


TTAGTATAGT 


1140 


50 


AACAATATTA 


TGAATGATTC 


ATTTTAATTT 


TACAACGTTT 


TTAATTTTTA 


TAAATTTTTA 


1200 




TAAAATTAAA 


CTAATTTATT 


CATTCGCAAC 


CCCTAAAAAT 


AATTTTTAGC 


CTTTCTGCGA 


1260 




ATTTTATGAG 


CTAGAAAGGC 


GCCCAACTCT 


CCCTGTTTGT 


TAACTTTCGC 


CTCGAAAGTT 


1320 
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CGAAwwTTAT GAG CTAG AAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 144 0 

CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 15 00 

5 CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 1560 

CTAATTATAT AGAAGAAAAA GTGCAT CAAT GACAAATTAA ATGAGATTTC TACTCTACCA 1620 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 1680 

10 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 

(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 08: 

GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 60 

25 GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAGCAGTAAG ATATTTTCTA 120 

ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 180 

TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 240 

30 

TAAGTTAATG TAATCCTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 3 00 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 3 60 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 420 

35 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 4 80 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 54 0 

40 GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAAC GGTAG A ATAAAAGGCT 600 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTC TTG ATTTTTCATG GAAATGAGAT 660 

GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGT CTCTA 720 

45 ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 7B0 

GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 84 0 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 900 

50 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 960 

AAG CTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 1020 

55 
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AAAAATGAAT 


ACCAGCAACT TCATCAATAA TTGTTGATGC ATAGGACTTA AATATGACAT 


1140 


CTTTCTCCAA AATAT CATTT AAAATACGTG TCATTAGTGG ACCATTGTGC GTCGTTTGTT 


1200 


CTGATAATGT 


ACGAATCTCA CCTGTAATAT GAACGTTTGT CGACAATTTG ATGTGTGGCG 


1260 


ACATAGCTGG 


GTATTTAGGA ACTAATGTTC TGAAAGATAA ACCAGCATAA TAATCCaACG 


1320 


TATGTTTTGC 


TTCAATGATT AATTCTTTAT CTACTTCTGC TTGATAATCA GAATGTAATA 


1380 


CGTCATCTAA 


TTGCCATGGA TGAACAATCA TAATGTGATA ATCATTAAAG TTAAACTTTG 


1440 


GCGTAAATTC 


ATTTTCTAAT TGTTTAATTA AGTCCGGAAA TAGTTGATGA ACAGTTGTAT 


1500 


CATAATCTTT 


AGACAGTGAC ATAGTACGGC TTAATTTACT GTGAATCAGT ACTATTTTCA 


1560 


ACTTAATAGG 


TTGATTGAAT TCTGAAGAAT ATAGGAATGT TTGTAATGCA TTTAAACCTT 


1620 


TACGTAATTT 


AGCCCCAGGA TGTAGCGGAT GACCTTCAAT AACGGCTTGC TCTGAACGCA 


1680 


AGTAACTATC 


TTCGCTATTT TCGATAATAT TAAATAAAGG TGCAGAATCA TGTTGCATTG 


1740 


ACAGTGCTTG 


ATAGCTAATT GCAAATGTCA TATtAGTtGC ACTGTTTATT AAATCTTGCT 


1800 


GAAATTGATC 


ACTAGCAGCA TTTTTTAAAT CTGGTGCTTC AATTAAAATA CACTCAAGAA 


1860 


TTTCATTTGG 


ATGGTGTACT CGTGTAATCG TATTTGTAAT GTCATCTTTA ATGTAGAAAG 


1920 


GGCCTTCAAC 


ATCAATTCGA TCAAAGGCGT GTTCTCCAGT GATAGGAGCA TATAATGTTT 


1980 


GCTTAGCTTG 


TGGGAACTGG ATTTCTAGTA TATGAGTCGT TGAGATATCT AACATAATCA 


2040 


AATCACGACT 


CAATATTTTC TTACTTTGAG TGCGCGCTTT AACTAAGTTT TCGCGATGCA 


2100 


TTGATGTGAC 


CAATCTCTGA GTGACTTTAT CTCTTCCTTG TAAAATCATC TCTTTAAAAA 


2160 


TATTAGCCCA 


ATCGCTATTA TGTTGTTGTA AAAATAAATA TGTTTCTTGT TCTTCTTTGT 


2220 


TAAATTTTAA 


TGTCTGTTCT TTAAAAATTA AGTTCAAGTT CATAATTCAC CTCTATGAAA 


2280 


TATTTTACAA 


AAGCAAGATA GATTTGTATA ATCCATATTA ATGATAATGA yTCTTATTAT 


2340 


CAACAGAATG 


CGGGTGTAAG TTTTATGACA AAATATTTTT TTAGCAGTTC TTTTCTACTA 


2400 


TTTCTAGGTA 


ATTGGATTGG ACAAATAGGG CTAAATTGGT TTGTACTTAC CACTTATCAT 


2460 


AACG CAGTTT 


ATCTGGGGAT TGTCAATTTT TGCAGACTTG TACCAATATT ATTACTAAGT 


2520 


GTGTGGGCAG 


GGGCAATTGC CGATAAATAT GATAAAGGGC GATTGCTGAG AATTACAATT 


2580 


TCATCATCAT 


TTTTAGTAAC TG CAATTTT A TGTGTGCTCA CGTATAGTTc ACTGCAATTC 


2640 


CAATTAGCGT 


CATTATTATA TATGCGACAT TAAGAGGGAT TTTAAGTGCG GTTGAAACAC 


2700 


CTTTAAGACA 


AGCAATCTTA CCAGATTTAT CAGATAAAAT ATCTACTACA CAAGCTGTttlw 


2760 


CATTTCATTC 


ATTCATCATT AATATTTGTC GTTCAATAGG GCCTGCCATT GCTGGTGTCA 


2820 
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CAGTTTTATT 


ATG CTTACCA 


TTACATTTTA 


AAGTAACTAA 


AATACCTGAA 


GaTGCATCAA 


2940 


GaTACATGCC 


GTTAAAAGTT 


ATTATAGATT 


ACTTCAAATT 


ACATATGGAA 


GGTCGACAAA 


3000 


TATTTATAAC 


ATCATTATTG 


ATTATGGCGA 


CAGGTTTTTC 


ATATACGACA 


CTTTTACCAG 


3060 


TTTTGACAAA 


CAAAGTATTT 


CCGGGGAAAT 


CTGAAATATT 


TGGTATCGCT 


ATGACGATGT 


3120 


GTGCCATTGG 


TGGTATTATT 


GCAACGCTAG 


TTTTACCTAA 


AGTACTTAAA 


TATATTGGTA 


3180 


TGGTAAATAT 


GTATTATTTA 


AGTTCATTTT 


TATTTGGCAT 


TGCTTTGTTA 


GGTGTGGTAT 


3240 


TTCACAATAT 


TGTCATCATG 


TTCATTTGTA 


TTACATTGAT 


TGGGTTATTT 


AGTCAATGGG 


3300 


CACGTACGAC 


AAATCGCGTT 


TATTTTCAAA 


ATAATGTTAA 


AGATTATGAA 


CGTGGTAAAG 


3360 


TACTGAGTAT 


TaTTATGATG 


GgATAGAGGT ATGaTTCCAT 


kGGGAAGTCn 


ATTAATGAGT 


3420 


ATATGTGCAG 


ATGTGTTTGG 


CATTGTTAGA 


ACTTTTTCAA 


TAATGGGAAT 


AAGTACTATA 


3480 


TGCATTACAA 


TGGTATTCTA 


TTTTATAAAT 


AGAAAGTTGA 


AGTTAAAGTT 


GGAGGAAAGT 


3540 


AATCATGGTA TATCTTGAAT 


GGGCAAAGGC 


AGATAGAAAT 


ATTCAATATC 


GTGTAATTAA 


3600 


CGCCATTATT 


AAAGAACGTA 


TTTACCCCGA 


GCAAACATTT 


ATTTCGCAAA 


AAGGATCTTT 


3660 


AATTGAAATT 


CAGTATCATA 


TGCATGTGTT 


GACTATTGAA 


GTTGTTAGAA 


AAAGTGCATT 


3720 


AGAACGCTAT 


GAGTTTACAG 


GTGATATTAC 


TTATTTAAAT 


AAAGGTGAAA 


CGTCATTAAT 


3780 


TATAACTTTA 


GAAGGTTTAT 


TAGATGTGTT 


GAATCATGAC 


TTTGATATCC 


CTATTTCAGA 


3840 


GCGACTACGC 


GAAGAGTTAA 


TACACAGTCG 


AGATAGTTTA 


GTTGAAACAT 


ATAAGCAAAT 


3900 


GTCTCACAGA 


CAAACGTTAA 


TAAGT CrtlAAG 


TTTTAAATTT 


TCAAGGTTAC 


CACAAGATAT 


3960 


TAACTTTTTT 


TCakGGTtAC 


AACATGTAAA 


AGATAGTGAT 


AAGACAGATG 


ATTTAACTTA 


4020 


TTCTGAGAGT 


TTGGTACCAG 


AGGGGCATCC 


AACACACCCT 


TTAACCAAAA 


CGAAATTGCC 


4080 


CTTAACTATG 


GAAGAAGTAC 


GAGCATATGC 


ACCTGAGTTT 


GAAAAAGAAA 


TCCCTTTGCA 


4140 


AATTATGATG 


ATTGAAAAAG 


ACCATGTTGT 


GTGCACAGCT 


ATGGATGGTA 


ATGATCAATT 


4200 


TATTATTGAT 


GAAATAATTC 


CCGAATACTA 


CAATCAGATT 


CGTGTGTTTT 


TAAAGAGTTT 


4260 


AGGTTTGAAA AGTGAAGACT 


ATAGAGCGAT 


TTTAGTACAT 


CCTTGGCAAT 


ATGATCATAC 


4320 


GATAGGGAAA 


TATTTTGAAG 


CATGGnTTGC 


TAAAAAAATA 


TTAATTCCAA 


CGCCGTTTAC 


4380 


AATACTTcCA 


AAAGCaACTT 


aTCatTTaGG 


ACGATGTCTT 


TAATTGATAA 


ACcATACCAT 


4440 


GTTAAgTTGC CCgTCGATGC 


aCAAGCAACA 


AGTG CCGTTA 


GAACAGTCTC 


AACTGTGACT 


4500 


ACTGTAGATG 


GACCAAAGTT 


AAGTTATGCT 


TTACAAAACA 


TGTTGAAT C a 


ATATCCaGGA 


4560 


TTTAAAGTTG 


CTATGGAACC 


GTTCGGTGAA 


TATG CAAATG 


TTGATAAAGA 


TAGGGCACGT 


4620 
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AGTGCAAGTC 


TAGTTAATAA 


AAATCCAATA 


GATCAAAAAG 


TTATCGTGGA 


TAGTTACTTA 


4740 




GAGTGGTTAA 


ATCAAGGAAT 


TACTAAAGAA AGTATTACGA 


CATTTATTGA ACGATACGCT 


4800 


5 


CAAGCATTAA 


TCCCGCCTTT 


AATTG CTTTT 


ATTCAAAATT 


ATGGAATTGC 


TTTAGAAGCA 


4860 




CACATGCAAA 


ATACAGTAGT 


GAACTTGGGG 


CCACATTTTG 


ACaTTCAATT 


TTTAGTGAGA 


4920 


W 


GATTTAGGTG 


GTTCTAGAAT 


TGATTTAGAA ACATTACAAC 


ATCGTGTATC 


AGATATTAAA 


4980 




ATTACAAATG 


ATAGTTTAAT 


AGCTGATTCT 


ATAGATGCAG 


TGATTGCAAA 


ATTCCAACAT 


5040 




GCTGTTATTC 


AAAATCAAAT 


GGCAGAATTA 


ATCCATCATT 


TTAATCAGTA 


TGATTGTGTT 


5100 




GAAGAAACCG 


AATTATTTAA 


CATAGTACAG 


CAAGTAGTAG 


CGCATGCCAT 


TAACCCAACA 


5160 




CTACCACATG 


CAAATGAGTT 


AAAAGATATT 


TTGTTTGGAC 


CAACAATTAC 


TGTCAAAGCG 


5220 




TTGTTAAATA 


TGAGAATGGA 


AAATAAAGTA 


AAGCAATATT 


TAAATATTGA 


GTTAGATAAT 


52B0 




CCGATAAAAA 


AAGAGGTGTA 


GTACTACATG 


GCACACGTTA 


ACATAAATAT 


ATCGAAGATT 






AAaTATAACG 


CCAAAGTACT 


TCAAACAGTT 


TTTCAAAGTA AAAATATGCA ATTCACACCA 


5400 




GTAATTAAGT 


GCATAGCTGG 


TGACCGTACA 


ATTGTAGAAA 


GCTTAAAAGC 


GTTAGGTATC 


5460 


25 


AATCATGTTG 
A 


CAGAATCCAG 


ATTGGATAAC 


ATAATTAGTA 


TTGCAGATAC 


AGGATTTAAC 


5520 
5521 



(2) INFORMATION FOR SEQ ID NO: 409: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

AGGTCGTCTr GCaGmAGmGm TATTTGACGA TCTTGCTTTC CCAAAACACG ATGATGATTT 6 0 

TAACATACTG TCTGATTATA TTGAGACACA TGGTGATTTC aCATTGCCAA TGTCTGTATT 12 0 

TGATGATTTA TATGAAGAAT ATACGGAATG GCTAAAATTT TAATATAATT TTTAATAATA 180 

ATAGTTAGAA CCAGGGTGAT GCAATTCGTT ATCCTGGTTT TAATTTAAAA TAAACTAAGT 24 0 

TGTGACTAAA AATTAATCAA TTATAGTGAA ATATGGTGCG CTATCTTGCA TAAATTGATA 3 00 

TGATTAACTA CACAGAATTT AAAAGTACAT AATACATAAT AAGGAAGTGA TACAATGGAT 36 0 

GATAAGCAAC ACACATCTTC ATCCGATGAT GAACGCGCTG AAATTGCAAC AAGCAATCAA 42 0 

GACCAAGAAA CTAATTCATC GAAACGCGTT CACTTAAAAC GTTGGCAATT CATATCAATA 4 80 
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CAAAAAATAA GTGGTTTAAA CAAAACTGAT CAAGCAAACT TAAATAAAAT TGAAAATGTG 6 00 

TATAAAATCT TAAATAGTGA TTATTACAAA AAACAGGACT CTGACAAGTT AAGTAAAGCT 660 

GCAATTGATG GCATGGTCAA AGAATTAAAA GATCCTTATT CTGAATATTT AACAAAAGAA 720 

CAAACGAAAT CCTTTAATGA AGGTGTTTCA GGTGATTTTG TAGGTATTGG TGCAGAAATG 780 

CAAAAGAAAA ATGATCAAAT TATGGTTACT AGTCCTATGA AGGGATCTCC AGCAGAACGT 840 

GCTGGCATTC GTCCTAAAGA TGTCATTACT AAAGTAAATG GAAAATCAAT TAAAGGTAAA 900 

GCATTAGATG AAGTTGTCAA AGATGTTCGT GGTAAAGAAA ACACTGAAGT CACTTTAACT 960 

GTTCAACGAG GTAGTGAAGA AAAAGACGTT AAGATTAAAC GTGAAAAAAT TCATGTTAAA 1020 

AGTGTTGAGT ATAAGAAAAA AGGTAAAGTT GGAGTTATTA CTATTAATAA ATTCCaGAAT 10 80 

GATACATCAG GTGAATTGAA AGATGCAGTT CTAAAAGCTC ACAAAGATGG TTTGAAAAAG 114 0 

ATTGTTTTAG ATTTAAGAAA TAATC CAGGT GGACTACTAG ATGAAGCTGT TAAAATGGCA 1200 

AATATTTTTA TCGATAAAGG AAAAACTGTT GTTAAACTAG AnAAAGGTAA AGATACTGAA 1260 

G 1261 
(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

AAATATATTG AAnAGAnAAT TACTAAGATT AAATCnTCTT AAAATATCCC TGAAATAACG 60 

TCCTAAAGAT TAAAGGAAAG AGGTTATAAG TTATGCCAAA ATTAATTTTA TGTCGTCATG 120 

GACAAAGCGA GTGGAATGCT AAAAACTTAT TTACTGGATG GGAAGATGTT AATTTATCTG 180 

AACAAGGTAT TAATGAAGCG ACTAGAGCAG GTGAAAAAGT AAGAGAAAAT AACATTGCCA 240 

TCGATGTAGC TTTTACATCG TTATTAACAC GTGCTTTAGA TACAACGCAT TATATTTTAA 3 00 

CTGAATCTAA ACAACAATGG ATTCCTGTAT ATAAAAGCTG GCGTTTAAAT GAACG CCACT 360 

ATGGTGGATT GCAAGGCTTA AATAAAGATG ATG CTAGAAA AGAATTTGGA GAAGAACAAG 4 20 

TACATATTTG GCGTCGTTCT TATGATGTGA AACCACCTGC TGAAACCGAA GAACAACGTG 480 

AAGCTTACTT AGCTGATCGT CGATA TAATC ATTTAGATAA ACGTATGATG CCTTATTCTG 54 0 

AAAGTCTGAA AGATACTTTA GTTCGAGTGA TACCATTTTG GACAGATCAT ATTTCACAAT 6 00 



1309 



EP 0 786 519 A2 



TTAAATATCT TGAAGATGTG TCAGATGAAG ATATCATTAA TTATGAAATT AAAACAGGTG 72 0 

CACCGCTTGT TTATGAATTA ACGGATGATT TAGAAGTTAT AGATAAATAC TACTTATAAA 780 

AaAAG AG CTG CATGTACACA AGGAGTGAGT GTATATGcAG CTCTTAAAtA TGTGAAGTAA 840 

TGTAAGGAAA TAGTTAAGTA TAGAGTTTAT ATTAACGAGC TAGGGATACT CGAAAATATA 900 

GTTAGACATA CAATATAGTC AAATTAAAAC AATTATTTCG CTCTTTTATG TTGCTTAATA 960 

ATCTTTAAAG CACGCTTTCT TGTTTTAATG TTAGGGCTAT TTAAATTACG ACGAGCAGTC 102 0 

TGTAAATCTA ATTTCATCTC TATCCCTCCT TGTAAATATA TTATGACCGA TAACTACTCA 1080 

TATGTAAATA GTAATGATTA CGTTTTAAAG AAATTGTAAT AAAGTCGTGC TAATTTTTTG 1140 

GAAAATGGGT ATAATTACCG GATATCTAAA AATGTGTGTC GTTTTTTAGA TGGTGAGGGG 120 0 

GAAGCTTTAA ATGTCGAAGA AACAAAAATT AACGATGATT ATTACTATG C TGATGGGTGG 12 6 0 

ATTTTTTGGA TTATTAAATG AAACACTATT AGTGACGGCT TTACCAAGTA TTATGAAAGA 1320 

TTTTGAAATT TCATATACAC AAGTTCAATG GCTGACAACA GCT TTTTT AT TGACTAATGG 13 8 0 

GATTGTTATT CCTTTGTCCG CGCTTGTTAT ACAACGTTAT ACAACAAGAC AAGTGTTTTT 144 0 

AGTGGG TATT TCTATCTTTT TCTTAGGTAC ATTACTCGGC GGCTTGAGTC CGCACTTTGC 1500 

AACATTATTA GTTGCTAGAA TTATTCAGGC GTTAGGCGCA GGTATTATGA TGCCATTGAT 1560 

GATGACAACG ATTTTGGATG TTTTCCAACC ACATGAACGC GGTAAATATA TGGGGATATT 1620 

TGGTTTGGTA ATTGGTTTAG CACCAGCTAT TGG AC CTACT CTTTCAGGTT ACCTTGTTGA 168 0 

ATATTTTAAC TGGAGATCGC TTTTCCATGT TGTCGCTCCA ATTGCAGCTG TGACATTTTT 1740 

AATTGGaTTT AAAAcGATAA AAAATGTTGG AACTACAATT AAAgTACCTA TTGATTTTAT 1800 

TTCTGTCATT TTTTCTGTAC TAGGTTTCGG cGGGTTATTG tATGGAACGA GTTCaATTTc 1860 

AGAAAAAGGT TTTGATAATC CTAcGtATTA GTATCTATGA TTGGAGGCGT TGTTTTAGTC 1920 

GCATTATTTG TAwTACGTCA ATATCGG CTA TCAACACCAT TATTaAATTT TGCTGTATTT 1980 

AAAAATAAAC AATTTACAGT TGGTATCATT ATTATGGGTG TCaCAATGGT ATCGATGATT 204 0 

GGTTCGGAAA CGATTTTACC TATCTTTGTG CAAAATTTAT TGCATCGTTC AGCTTTAGAT 2100 

TCTGGATTAA CTTTATTACC AGGAGCAATT GTTATGGCAT TTATGTCGAT GACTTCGGGT 2160 

GCTTTATATG AAAAGTTTGG TCCTAGAAAT CTTGCTTTAG TAGGTATGGC GATTGTTGTT 2 22 0 

ATTACTACGG CTTATTTTGT TGTAATGGAT GAACAAACAT CAACAATCAT GTTGGCAACA 2 280 

GTTTATGCGA TTCGAATGGT TGGTATCGCG TTAGGATTAA TTCCAGTAAT GACCCATACG 234 0 

ATGAATCAAT TAAAGCCAGA AATGAATGCA CATGGTTCAT CTATGACAAA CACAGTACAA 24 00 
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AACTTTTCTC CAACTATGTC AGACTATA 
(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1105 base pai 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

TTTTACAATT TCAGATATCT CTAAAACACA TACCGATCCA ATGGCCAAAA TTATTCGTCA 60 

AAAATTGAAA AAACTAGGAA TTCGTAAAGG GATTC CAGTT GTATTTTCAG ATGAAAGTCC 12 0 

AATTGTCATA AGAGAAGATG TAAAAGATAT AGTTGGAGAT AAAAATGCTA TCAATCGAAA 180 

AGGACAAATG CCACCTTCTT CAAATGCCTT TGTGCCAAGT GTTGTTGGAT TAATTTGTGC 24 0 

AAGTTATGTG GTGAATGACG TATTAAAAGA TATTCCAGTT CGTCGCATTA AAGACAAAGG 300 

GCAATAATTC ATTTTGAAAG GGATAATTTT CAACGTAAGG CAAGTGTAAC CACACCATAA 3 60 

AACTAAAACT GACTAGTTCG CAAGCATAGT AGAATATGCC TCGTGTACTA GTCAGTTTTG 42 0 

ATTTGATTAT AGCTAAAATA TCAAAGGTAA TCGATTTAAT CTTATTTTTA CGGCCTTTGT 480 

TTAAGTAAGT TGTTATATAT TTCTTTGAGT TGTTGTTCAC TTTTAGATGT CGTTTTTGGT 540 

TCGTAATAAA TTTTGTTTTT TAGTTTATCA GGTAAATATT GCTGTGAAAC ATAGCCATTA 600 

ACATATTGGT GTGGATATTT GTAACCAATA GATCGGCCTA GAT CTTTAG C ACCTTGATAA 660 

TGTCCATCTT TTAAATGGTT TGGAATTTGG CCCACATGAC CGTTTCTAAT ATCGGACAAT 720 

GCACTATCAA TGGCACTCAT TGCTGAATTT GACTTAGGTG ATAAGCATAG TTCGATTACT 780 

G CTTGGCTTA GTGGAATTCT AGCTTCTGGT AAACCTAGAC GTTCTGCTGA TTCAATAGCA 840 

GCAAGTGTTC TCTGACCAGC ATTAGGTGAG GCTAAGCCTA TATCCTCATA GCTAATTACA 900 

AGTAATCGTC GAACTATTGT AGGTAAATCT CCAGCTTCAA TTAATCGTGC TAAATAATGT 960 

AAAGCGGCAT TGACGTCGCT ACCACGGATA GATTTTTGGA AAGCGCTCAT AACATCGTAA 1020 

TGCATGTCAC CATCCTTGTC ACTTACAAAT GCACCTTTTT GTAAACAGTC TTTAGCATCT 1080 

TGCAATGTAA CATGTCGATA ACCGT 1105 

(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 57 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTTAGCATA GTCATTTAAA 60 

TAAACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 120 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

ATATATAACA AATCTTCAAA CTCTGTAGTT ATTGG ATTG C TATGATATGC ATAATTAATA 24 0 

ACATCTTCTA AATCGTTAAA TTTCACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 3 00 

TTTTGACGTT TAGAACCTTG AG CTT CTTTT TCTTTTGTTC GTTGCTCGAA TAATTCTTCT 360 

AATTGATCTT CAC CTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 4 20 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 4 80 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGAC CTn C TACAACAAAA 54 0 

TCTTCTACTT CGTTAATTTC ATCCATCATT GAnCAAAAG 579 



(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 
GTGGTGAAAC TTCTTGCTTT GTAATTTTAT AAAGTGGATC AATATCACTC TGACTTACAT 60 

35 CAGGTTGAAC TGTCATCTTT TTAGTTACTT TGTTTTCAAG CG TAATATTT GCTAATTcTA 120 

GGtTTTACAG AATAATGAAT GTCATCAGCT AmCCCTTTmm CtTGATACTy CTCTTGATAT 180 
TTTCCAGCTT TAGCATTTGA TAAATCAATC ACTACTCTTA AATCTTCTGG ATTTTCAATT 24 0 

40 TTTATTATCT TTGATTGTGG TCCTGAAATA GTCACATTAA CTGTTTCAGG CGCTTTTGTT 300 
AAATGCAAAT CTTTAGTGTT ATAAAGAATT TCAACGGGTA CATCTTGAAT CGTTTTACTA 36 0 

GACTTTTGAC CAAGATTACC AGTGTTAAAG ATATTTCCAA AAACATTGTT AACAGATAAA 42 0 

45 

AAGAAAAACA ATGCCAAAAG AAAGGCAATA AATCTCAAGC CCCATTTACT TTCTAGCATA 480 
TTATTTCACA CCTTTCTTTT GAAAGCGTGT GCCAAACCAA TGTTCAGCAA GCAACTCTTC 54 0 

AAAAATTTCG TTTGAAATGT CTCGTCGTAA TTTTCCATCA AATGTTACCG AAATATCACC 60 0 

so 

AGTTTCTTCA GATACAATAA CGGTAAATGC ATCAGATACT TCTGAAATAC CAACCGCAGC 660 
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TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA CCATCATGTA AAGGTGTGTT 78 0 

AGGTATAAAG ACATTAATTA AAAGTTCTTG CGAAATATTT GAAT CCATTG CAATACCTGT 84 0 

TTCAATATAA TCTTGAAGAC CTGTTTCTTT TTCAAAGACA ATTAATGCAC CTATACGTCT 9 00 

TTTAGCCATA TATTGCACAG CCTTTGAAAC CGATTGAATC AATTTCTCTT CATCTTTACT 96 0 

ATACGTATTA GAAGTATAGC GTTTTAAAAA GCTACCTCTA CCAAGTTGTT CTAACGCACG 102 0 

TCTAATTTCT GGTTGGAATA TTACTATTAA AGCTAATACC CCCCATTGAA TAACGATATC 1080 

GAATAATTTA GATGTTGCAG TCAAGTTCAA TATCATACTT ATCTGCTGAC CAATAACAAT 114 0 

TACTAATATC CCTTTAAGTA ATTGTATCGC TTTAGTTCCC TTAAAGACCG TGATGAGAAG 12 00 

ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC GTTACAATTT TTAACGTACT 126 0 

GAGGTTTTGA AAAAAGTTGG AAAAATCCAT AACATCTCCT CCGGGTAATA TTTTTCCATA 1320 

ATACCCATTA TACCAATCAT TT 1342 
(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

CTG TATAAAG ATGGAGGTGT TTTGCATGGT AAAACGTACT TATCAACCAA ATAAA CGTAA 6 0 

ACATAGTAAA GTTCATGGTT TCAGAAAACG CATGAGCACA AAAAATGG CC GTAAAGTTTT 12 0 

AGCGCGCCGT CGTCGTAAAG GCCGTAAAGT TTTATCTGCA TAAGATCACT GACTCATCAG 18 0 

TGAtCTkTTT TTTCGTTTAA ATTAAGAATA AATAGAAATT TATGTTATAA GCTCAATAGA 24 0 

AGTTTAAATA TAGCTTCAnA TAAAAACGAT AnATAAGCGA GTGATGTTAT TGGAAAAAGC 300 

TTACCGAATT AAAAAGAATG CAGATTTTCA GAGAATATAT AAAAAAGGTC ATTCTGTAGC 36 0 

CAACAGACAA TTTGTTGTAT ACACTTGTAA TAATAAAGAA ATAGACCATT TTCGCTTAGG 4 20 

TATTAGTGTT TCTAAAAAAC TAGGTAATGC AGTGTTAAGA AACAAGATTA AnAGAGCAAT 4 80 

ACGTGAAAAT TTCAAAGTAC ATAAGTCGCA TATATTGGCC AAAGATATTA TTGTAATAGC 54 0 

AAGACAGCCA GCTAAAGATA TGACGACTTT ACAAATACAG AATAGT CTTG AGCACGTACT 60 0 

TAAAATTGCC AAAGTTTTTA ATAAAnAGAT TAAGTAAGGn TAGGGTAGGG GAAGGAAAAC 66 0 

ATTAACCACT CAACACATCC CGAAGTCTTA CCTCAGACAA ACGTAAGACT GAC CTTAGGG 72 0 
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TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA GGGGCAATTG GAATTGTTCG 84 0 

ATTGTCTGGA CCGCAAGCCG TTGAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 900 

5 AAATGATGTT CCATCACATA CGATTAATTA CGGTCATATT ATTGATCCAG tCTAAAGAAG 96 0 

TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 10 20 

TTATAGAgAT TAATTGTCAT GGTGGtATTk TAACGATTAA TAGAGTGCTG GGA 1073 

W 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 
JS (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY -. linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 







GCCAAATCCG 


AAGTAAAGTA 


TAGCAATAAA 


GATTACTAAT 


ACAATTCTGT 


60 




AAATGGCAAA 


TGGAATTAG T 


TTGATTTTGT 


TAATTAGATG 


CAAGAATGTT 


TTGATTGCAA 


120 




TTAGTCCAAC 


AGTAAATGCA 


GCTAAAAAGC 


CTAAAATATA 


AAAAGGTATA 


TCAGCAATCT 


180 




GAATATCTTG 


ATAATGTTTT 


AATAAAGATA 


AACCACTAGC 


TGCTAACATA 


ATTGGAACAG 


240 




CCATAATAAA 


TGTAAAGTCC 


GATGCTGCTT 


TATGATTTAA 


TTTCATTAAT 


ACCCCAGTTG 


300 


30 


AAATTGTTGA 


GCCTGAACGG 


CTGAAAC CAG 


GCCACATAGC 


TACTGCTTGA 


gAAATACCAA 


360 




TTACAAATGC 


TTGGAAATAA 


CTGATTTGAT 


CTACTGTTTG 


TGGGTTTTTA 


ACTTTAGCTG 


420 


35 


AGTATTTATC 


AGCAATAATC 


ATATAGATAG 


CACCTACGAA 


TAAGCCAATC 


ATAACAGTTG 


480 




GCACACTAAA 


TAAATGTTCT 


TCGATGAAAT 


CATCAAATAG 


TAAGCCTAAA 


ATACCTGCTG 


540 




GCACCATACC 


CACTAATACA 


TGTAATAAAT 


TTAAACGTCT 


TGGCTTTGAA 


CGTCTTTGTT 


600 


40 


GATCGTTATC 


TCCTTCAACA 


TGTTTGTGTT 


TACCAATATG 


TAAAATCTCT 


AAGAAGCGTT 


660 




CGCGGAACAC 


CCATGCTGCT 


GCAAAGACGG 


AT CCTAATTG 


GATGACGATT 


TTAAATGTAA 


720 




ATGCTGACTG 


AGAACCTAAA 


AATTCAGATG 


ATTTTAACCA 


CATATCATCA 


ACTAGGATCA 


780 


45 


TATGTCCAGT 


AGAGGAAACA 


GGTGCAAATT 


CTGTTAATCC 


TTCGACGACC 


CCTAAGATAA 


840 




TACCTTTTaT 


TAATTCAATG 


ATAAACATAA 


TGTACCCACT 


TTCATTACT C 


AATTTAATTT 


900 




ATTTAAATAT 


CAAAATTACC 


ATATCATGAT 


AGCATATTCA 


TTTAAAGACA 


TGCTAGTTAT 


960 


50 


AGTTATAATA 


CTAGACTAAA 


GATGTATATA 


TTCATTTTCT 


TTTACATGTA 


AAACTACAAT 


1020 




ATTTTATTGA 


GCTATTTAAT 


TTGATTTTAA 


GGAAAACCTT 


TTATAATAGG 


TTTAGGTGAT 


1080 
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TTCTTGGTCA 


GTACTGGTCT 


CGGCATAATC 


GTTATAACGC 


AAAATATTTT 


AATAGCAGAT 






TTTTTAGCTA 


AAATTATAAG 


ACATCAATTT 


CAAGGTTTAT 


GGATTGTATT 


ATTTATTTTA 




5 


TTAGGTGTTT 


TACTTTTAAG 


AGCAACTGTG 


CAATTTCTAA 


ATCAATGGTT 


AGGTGATACA 






TTAGCATTTA 


AAGTTAAGCA 


TATGCTTAGA 


CAGCGGGTTA 


TTTATAAAAA 


TAATGGTCAT 






CCAATCGGTG 


AACAAATGAC 


TATACTCACA 


GAAAACATTG 


ATGGTCTAGC 


AC CTTTTTAT 




10 


AAGAGTTATT 


TGCCTCAAGT 


GTTCAAATCA 


ATGATGGTTC 


CGCTCATCAT 


AAT CATTGCA 






ATGTTTTTCA 


TCCATTTCAA 


TACCGCATTA ATTATGTTAA TAACTGCACC 


ATTTATT C CT 




15 


TTGTTTTATA 


TTATTTTCGG 


TTTGAAAACG 


CGAGATGAGT 


CAAAAGATCA 


AATGACTTAT 






TTGAATCAAT 


TTAGTCAACG 


GTTTTTAAAT 


ATTGCTAAAG 


GTTTAGTGAC 


GTTAAAGCTA 






TTTAATCGTA 


CAGAGCAAAC 


AGAGAAGCaT 


ATTTa CGACG 


ATAGTACTCa 


GTTTAGAACT 






TTAACAATGC 


GCATTTTa CG 


CAGTGCTTTT 


TTATCGGGAT 


TAATGCTCGA 


ATTTATAAGT 






ATGTTAGGTA 


TTGGATTGGT 


TGCATTGGAA 


GCAACGCTAA 


GCTTAGTAGT 


ATTTCATAAT 






ATT G ATTTTA 


AAACTGCGGC 


AATTGCGATT 


ATTTT AG CG C 


CTGAA TTTTA 


TAATGCAATT 




25 


AAGGACTTAG 


GGCAAGCGTT 


CCATACTGGA 


AAACAAAGTG 


AAGGTG CCAG 


TGACGTTGTG 






TTTGAGTTTT 


TAGAACAACC 


GAACTATAAT 


AATGAATTTC 


TATTAAAGTA 


TGAGGAAAAC 






CAAAAGCCAT 


TTATTCAGTT 


AA CAGACAT A 


TCATTTCGAT 


ATGATGATTC 


TGATAGATTG 




30 


GTATTAAATG 


ATTTAAATTT 


GGAAATATTT 


AAAGGTGATC 


AAATTGCACT 


TGTAGGTCCA 






AGCGGGGCAG 


GTAAATCCAC 


TTTGACACAT 


CTTATTGCAG 


GTGTTTATCA 


GCCAACAATA 






GGTACTATAA 


GTACAAACCA 


GCGTGATTTA 


AATATAGGAA 


TACTTAGTCA 


ACAGCCATAT 




35 


ATTTTCAGTG 


CTTCTATAAA AGAGAATATT ACGATGTTTA AAGATATAGA 


AAATAATACT 






ATTGAAGAAG 


TGCTAGACGA 


AGTAGGTTTA 


TTAGACAAAG 


TGCAATCTTT 


CACAAAAGGC 




40 


ATTAACACAA 


TAATAGGTGA 


AGGAGGCGAA ATGTTATCTG 


GTGGACAGAT 


GAGACGCATA 




GAACTTTGCC 


GTCTTTTAGT 


TATGAAGCCA 


GATCTCGTTA 


TATTTGATGA 


GCCTGCAACT 






GGTTTAGATA 


TTCAAACAGA 


ACACATGATT 


CAGAACGTTC 


TGTTTCAACA 


TTTTAAAGAT 




45 


ACAACGATGA 


TTGTCATTGC 


ACATAGAGAT 


AATACAATTC 


G CCATTTACA 


ACGACGCTTG 


2640 




TATATAGAAA 


ATGGAAGACT 


GATTG CTG AT 


GATCGCAATA 


TTTCAGTAAA 


TATAACAGAA 


2700 




AATGGTGATG 


ACTTATGAAA 


ACACGACTAA 


AATTTCAAGT 


AGATAAGGAT 


TTATTGTTAG 


2760 


50 


CTATAGTTGT 


TGGTGTTTGT 


GGAAGTTTAG 


TTGCGCTCGC 


CATGTTTTTC 


TTAAGTGGTT 


2820 




ATATGGTGAC 


ACAAAGTGCA 


CTTGGTGCGC 


CACTATACGC 


TCTGATGATT 


TTAGTCGTTA 


2880 
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ATAAAGCTAC 


ATTTACAATG 


CTACGTGATA TTCGGGTACA 


GTTTTTCGGT 






ATGTCATTCC 


TAATGTTTAC 


CGTAAACTGA GTTCTAGTGA 


TTTAATTTCA 


CGTATGATTA 


3060 


GTCGTGTTGA 


GGCATTACAA 


AATATAkATT TACGTGTTTA 


TTATCCACCA 


GTCGTCATCG 


3120 


GTTTGACAGC 


GCTAGTTACA 


GTCATAGTTT TGGCGTTCAT 


TTCAATCGGC 


CATGCG 


3176 


(2) INFORMATION FOR SEQ ID NO: 416: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41G: 






TCTTTATTTC 


AATTTCCAAT 


TGAATTTTTT TATTATTTAC 


GCATAGCTCT 


TAAAATTAAC 




GTTACGATTG 


CAATTAAGAT 


AATTGAACCA ATTAATGCTG 


GCAAGATGTA 


AATACt TCCT 




AATTCAGGAC 


CCCATTGTCC 


GAATAGTTTG CCACCTACCC 


ATGATCCAAT 


AATACCTGCG 




ATAATATTGC 


CTAAAATACC 


ACCTGGGATA TCTTTACCCA 


TAATAGCACC 


AGCAGCCCAT 




CCAATTAAGC 


CACCGACAAT 


TAACATTCCA ATAAATCCAA 


ACATAATTTT 


CAGTCTCCTT 




TTTCTATTTA 


TTTTGCGTTA 


TTCTAAGTAG TACCCCTTAT 


TTACAATTCT 


AAAACAAATT 




CAAATTATTT 


TTATCCAAAT 


ATTTTTAAAA G TAGTAATTG 


AATATCAATT 


TTATTCAATG 




TAGCTATCGT 


TATTTAAAGT 


CTCTGTACCG ATAATATCAT 


ATACATTTAC 


ATTATTTTTT 




CTGCCGAATT 


CATAGCTTGA 


TTATTTTATG TTATAGGACT 


AgAATATACA 


CATATTATTA 




GAGCATCTTT 


GAATTTTAAA 


TCAAGAAGCG AGGTTAATGA 


ACAATGAATA 


TG CAT AT TTT 




ATATaACTTA 


CGAACTAAAC 


ATAATTTAGA AATTGACGAA 


TTAGCACAGC 


AATTAAATGA 




GAAATATGGT 


ACTAAATATG 


AAGCACATCA AATTTGGGAA 


TGGGAGAATC 


ATCACCATGA 




ACCTAAATTT 


AAAGATGCCA 


TGCATTTAGC TGACTtCTTT 


GATGCACCAT 


ATGAAATGTT 


780 


TTTAGAAAGT 


AAGGTTAAAG 


AATATCAGAA ACATTTAGAA 


GAAGTCGATA 






TAAATAGATG 


CAAATAAACC 


CTCACAACAC GTTTGGCATA 


TATCCTTTCA 


AATCTATACT 


900 


GGATATATTA 


CATTACGTTG 


TGAGGGTATT TTATTAATTA 


ATATGAATTA 


AGACATTTTA 


960 


CAAGCGTTAA 


TGCAACGAAT 


CTTTTTAGTG ATCTTGCTCA 


CT CTTTAAT A 


CTTTACCGTT 


1020 


CTTAGCATCA 


ACAGTAACTT 


CTTGTTTTTT ATTACCTTTT 


TTCAAATCGA 


TATTGTAAAC 


1080 


AAGTTTGCCA 


TCATCTTTTT 


CAAGTGACCA TTCTTTAATA 


TCACCATCAA 


ATTCTTTTTG 


1140 
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ATTCATTGTA TCTTCTTTTT CAGTCTTTTT GTTAATCACT TTTTTATTTT TATCAGCAAC 1260 

AAGTACTTCT GACT CTTCAC GAGATTTTTG TTGCGTCACT TTATAAGCCC ATTCACCATT 13 20 

AGAATTTTCA AATGAAATTC CTTTCAACTT TTGGCCTTTG TAAGTTTCTT CAGCTTTTTT 13 80 

CACAGCATCT TCTGGGCTTG TTTTAACATC TTTTAAAGCA ATAACATCTT TTGTTGTATT 144 0 

AGTGTCTTGA TTAGTATTTG ACTCTGTTGA TTTTGTTTCA TCTTTTGGAG TATCATTGCC 1500 

ACATGCAGTA AGCACCACCG CTGACATTGA TAACACTGCT AATGATTTTA ATTTCATAAT 1560 

ATCACTCTCT CTTCCTATTT TTGAAACTCA TAACAAAAGC TTATATGCTA TATAGATTGT 1620 

ATTACCCCTT GTTTTTAATT TTATTCATAA TTATTACAAA TATTTTTAAA TTAATCGTCA 16 8 0 

TGCGTTACTT TCGTTCGTAC TCCTTTTATA AATGAACCAT GTAATATAAG CATGCTATTA 174 0 

TCGACTCAGT TTGTCTAATG CTTTCTTTGG TACTTCTTCC TTTTCAACTT CTTCAAAAGT 1800 

TTCTACATGA TGACCTTTAT GTGTAATTTT CAAGTATCTA TGCGGTTTAA CATCAAATGT 1860 

AGCAGTATAC ATTAATTGCG TCTCTTTCCC TTCTTTATTA AAAACACTTT GACTATAACT 1920 

TCGGAGTTGA TCATCCATTC TAGTAGACAC CTCTGTCGTT TTAACATAAG AATCATCCTT 19 80 

CTTTACTAAT GGGTTAAATT GATCTGTTAT ACCATGGGAG TCTATTGTTT TTAAAATGAA 2040 

TAAAGCAg CA TAAGCGCCAA TGATAG TAAG TACAAGATAt GTkATTGTTT TTAAAATCGT 2100 

TTCAAAAGA 2109 
(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

GTTACATAAA TGAAAACAAA AAAGATAATT TTACTGCTTA TGCTACACCA GAACATAATT 6 0 

ATCAATTTGG TGGTGCTATG ATAGAAAGTG AAAAATTAAG CGAGTTACTA AAGCCAGCCA 120 

ATCAGTTAAA ATCACCAGAT GATATAAAAA AAGAACTAAA TAAAAAGAAG AGTCA CT AAA 180 

GTTAG GAGTT ACTTTAATGT CCAAAAAACA TGTTTTTATA ATTATTGGTG TCATATTGTG 24 0 

TATATGTACA GTTTCTACGG TCATGCATTT TAAAATGAAA TATGATGAAA AAGAAAAACA 3 00 

AAAAGCGATT TACTACAAAG AACAACAAGA ACGTATTACA CTCTATCTTA AGCATAATAC 3 60 

TAAAGAAACG AACACGATTA AATCTGTACA TTTCACAAAC TTGGAAACAA GTCCTATGGG 42 0 
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ATCGCCTGAA CATAATTATC AATTTGGTGG CGCTATGATA AAAAGTGAAG GAGTAGATAA 54 0 

ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 600 

5 

AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 6 60 

TGCTATCGAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 720 

GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAwT AaTAGTTTGG 78 0 

CCATTATAAG AACAATATAT AT CGAnTAAC AAT 813 
(2) INFORMATION FOR SEQ ID NO: 418: 

;5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 64 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 
AAAAGCAATC GTTGGTGGTG CTAAATTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 60 
25 TACACTTTAC AAAATGAGAT GGAATCCTGC ACATCCAGGA ACACACCAAT ATGCTACAGA 12 0 

TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACTATGATA AAATTGGCGA 180 
AGTCGG CAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 24 0 

30 

GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 300 
GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 360 
ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 42 0 

35 

GTGTTACAAC GATATGCGTC GTTGATTTAA CTTATCAAGT AATTGATTTA AATTGTCTAA 48 0 

TTCGACTTCC GATAAACATT GACATCTTGC TTCAATCAAT TCGCAACGTG CATTATTTAT 54 0 

40 TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 60 0 

GCTAGATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 640 
(2) INFORMATION FOR SEQ ID NO: 419: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 
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TTCTTGGATA CGTAATAAAT TACCTTTTGT CATAGGATTA AACATAGATC CAGGTAAAAC 120 

ATAAACATTT CTATTTTGCT CTAATGCAAA ATCTATCGTG ATGTGACTGC CACTTTGTTC 18 0 

5 CTTAGCCTCA GTAATTAAAA CCCCTTTTGA CAAACCGCTG ATAATTCTAT TGCGCTCAGG 24 0 

AAATCTATAT TTAGCGATTG GTGTATGTGG TGGATATTCA GATATAACTA AACCTTTTTC 300 

TTCTATTTTA TTTCTTAATG CTAATGTACT TTTGGGATAA TGTGTTTGAT GGCCAAAGGC 360 

10 

TAAAACTGCA ATTGTAGGGA GATTGTATTT T AAAG CTATT TGATGTGCCA TTGCATCAGC 420 

TCCTTGAGCA AGG CCGGAAA CAATTGTTAA ATATTTGCTT TTAT CATTTG ATAATAAAAA 480 

TTCTAAAGAC TGTTGGGTAT AACTTGTAGA ATCTCTTGCA CCTACTACTG CCAAATGATG 54 0 

75 

CATATTATTT ATTAATTTGA TGTTCCCTTT ATAGAAAAGA AGTAATGGAA ATTGATATAT 6 00 

TTCTTTTAAT AGCACTGGGT ATTCAGAATC CATATATGTA ATGTAACTTA CTTTTAATTT 660 

2Q CTG CAGTTCT TTAATAATAT CGTTATGATC AAGTTTTATA AAAGCATAGT ATTTACGTAA 720 

TAAATGAACA TTTTCTTCCC TATTCACCCA TTCACATAAA TAACTATCTT TTTTTCTTCC 78 0 

CTCCTCTTTA ATTACATTAG GATATGCCAT TAAAAATTGA TGAATTTGTT TAGTCGAAAA 84 0 

25 GTGTGCCCAG TATAACTTAA GCAAAAATAG TTTAATCAAT AAATCAACTC CTTTTTGTAA 900 

TCATACAAAA TCATATTCTA TTTTTGTTTT ACATTTCTAA TACAAAAACA TTGTCGATGT 960 

AATGTTATTT TAAGGAGTAA AAATACTGAC TTAAAAAGTG AAAAGTATGT TGGAAAGAAT 102 0 

30 TTAAAATTTC AATTTCCAAC ATACTTTATA ATTAAACCTT ATAAATAAGT TTTGCAAATT 1080 

TATATAGAAT TGGTCTTACT GGTTTGATGA AATCACCAAT TAATTCTTCA ACATGTGCAT 114 0 

TAAAACCCTT TTTAAATTG 1159 

35 (2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 6 0 

CCCCACTAAT ATTAT CATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 12 0 

TGGCGGTATT CGTACAA CAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 18 0 

TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 240 
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ATCAGCTACT GAAAATGGTA AGCTTACATT TTTACAAGTA TTTTTTGAAG TCATGTCTGC 3G0 

ATTTGGAACT TGTGGACTAT CGCTTGGTGT CACAAGTGAT ATTAGTGATA TTTCTAAGGT 4 20 

CGTACTAATG ATACTCATGT TTATAGGACG TGTTGGCTTA ATATCATTTA TCATTATGAT 480 

AGCAGGACGT CGAGAACCAG ATAAATT CCA TTAT CCAAAA GAACGTATTC AAATAGGATA 54 0 

ATATAATAGC AATCTAAGTT TAGTTAATGT AGATTTTAAC TGGAACTTAG ATTGCTTTTT 60 0 

TAGTTTGTAT TTTTAACTTA TTTTATAAGA CGATTGGTTT CGAAAATGGT AAACTAGTAA 6 60 

CAATGAGAGG TGTAACATGA TGGAAAAAAA TGAAAACATT AATGTAGAGA TTTTAACTAC 72 0 

GTCAGATATG CATAGTCATT TCTTAAATGG TGATTATGGT TCAAATATTT ATAGAGCTGG 78 0 

TACTTATGTT AACCAAGTAA GAGCACAAAA TCATCGCGTC ATTTTATTAG ATAGTGGCGG 84 0 

AAGTTTAGCT GGCTCGTTAG CGGCCTATTA TTATGCTATT GTTGCACCTT ATAAACGACA 90 0 

TCCAATGATA AAGTTAATGA ACAGAATGCA TTATGATGCT AGCGGTGTGA GTCCAAGTGA 96 0 

ATTCAAGTTT GGTTTATCAT TTTTAACTCG TTCAATTGCT TTGGCACGTT TTCCATGGTT 1020 

ATCAGCAAAT ATTGAATACA ATGTTACTAA GGAGCCTTAT TTTTCAACTC CATATTGTAT 108 0 

TAAACATTTT GGTGACTTAA AAATTG CTAT CGTAGGCGTC ACAGCAGATG GTTTAATGGA 114 0 

AAATGAGTAT TCTGAAATGG AGCAAGATGT ATCTATTGAA AAGACATTAG TGGCATCAAA 12 00 

ACGTTGGATT AGATATATCC ATGAAGTTGA AGAGCCAGAT TTTTTGATTG TAATTTATCA 12 6 0 

TGGTGGATTG AATAAAATTA GTAATAGTAC GAAAAATAAA AAGGCAAGTT CGAATGAAGC 132 0 

TGAAAAATTA ATGGAAGAAC TCGGTGTTAT AGATTTAATG ATTACAGCTC ATCAGCATCA 138 0 

AACAATAGTA GGTCAAGATC ATGAAACGTA TTATGTTCAG GCTGGTCAAG ATGCCAAAGA 144 0 

G CTTGTA CAT CTTTCGATTA ATTTTAAAAA GAGAACAACA ACTTATGATG TTGAAAG CAT 150 0 

TGATTCTAAA GTGATTGACT TAAATGAGTA TGAAGAGGAT CAAGAATTAT TAGATTTAAC 156 0 

ATTCTATGAT AGAAAAGCAG TGGCTTATTG GTCACAGGAA ATCATAAGTG ATAAAGGTTT 162 0 

G ATGTT AT CA GTAAATGGGT TACAAGATTT AG TCTGTCAA ACACATCCAT TTTCGCAATT 16 8 0 

ATTACATGAT GCAATTCACC TTGCATTTGA TAATGATATA ACATGTGTCC ACGTGCCTAT 174 0 

GaACGGAGAG AAGGGGTTGA GTGGACAGAT TCGAAATGAA GrTTTGTATC aTGCATACCC 180 0 

ATaTCCAGAT aAGCCACGGG CATGACAATT aGTGGTCAAA ATaTCAAGGT ATnTTGGGGT 186 0 

ATAGTTATTC ACCATTAGG 1879 

(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

AATTCCCCCT TTTTCCCCAT CCATTTTnCC ATCCACTGGT CCTTACGGGA CCATATTATT 60 

TnAAATTGGA nAAAATTATt TTAAAGaTTA TwACtACTCT TtAATCATTT TrGTGAATTA 12 0 

10 

AAAAAAGTAG TGCAAAAAGC AAAATATACT TTATACACTA CAAATCATTT ATTTATAATA 180 

AAGTTTCACC AAAAAATGTT CCAACTAATG AAACCGCTTG TTCAGCAGTA TGATTATTAC 24 0 

r5 TGTCAAT CAA TGGATTTACT TCAACTAAAT CCATTGAGGA AATTAAATGT GATTGATGCA 3 00 

GTAATTCCAA TGCAAAATGG CTTTCTCTAT AACTAAGACC ACCCAAAACT CTAGTAC CAG 3 60 

TGCCTGGCGT TTCAAGCGGA TCTAAAGCAT CAACATCTAA AGATAAATGA ACGCCATCAA 420 

20 CATTGCGTGA CTTCAAATAT TCTATTGTAT TTTCAATTAC TTCCTTTATC CCCAATTTAT 480 

CAATATCTGA CATAGTAAAT GTTTTAATAT TATGATCTTT GATAAATTGT CTTTCACCTT 54 0 

TATCTAAATC TCTCATACCA ATTAGTACGA TGnTTTCTGG CTTGATTACA TTACnATTTA 60 0 

25 ATTCTAAAAG TTCTTTGGGA CCTTCGCCTG TCAAAATCCT TAGAGGCATA CCATGAATAT 660 

TTCCACTTGG TGACTCTTCA GGTATATTTA AATCACCATG TGCATCATAC CAAATAACAC 7 20 

CTAAATTATT ATAATGTTTA CTTATTG CTG ATACTGAACC TACCGCAATA GAATGATCTC 780 

30 

CACCAAGAAC TAGAGGAAAT CTGTTATTTT CAATTGATGC TGAAACCTCT TTATTTAATT 84 0 

TTTGATTAAC ATCTATAATT TCATCATAAT TTCTTAATCC TTTTTGTTCA CTATGAAATT 9 00 

TTTCAATGTT CACAGCAGGt ACCTTAATAT CCCCCTTGTC ATATACATCA AGGTCTAATT 960 

35 

GCTTTAATCT TGAAATTAAT CCAGCATATC TAATTGCTGT TGGTCCTAAA TCAACACCTA 1020 

ATTTTCTTTG TCCAAATGTT GATGGTGCAC CTATAATATC AATTG CTTTT GTCTTTGTCA 1080 

TAAGCGTCCC CCTTTGCTCT ATGTAATTAA AGAATAATGT ATGCGCTTAC CATTATCAAG 1140 

40 

CAATAGCTAC ACATATAATC TGTTTATCTT ATTACTTCAT AAAAAAAGGT TCTTCATCTT 1200 

TTATGGTGGG AAGGTAAAAC TTCCTGCTTT TTTTAATACA CAAAAAGCGC AATTGCCTCT 12 6 0 

45 ATAATTTAAA GTGACCAAAC CCAAACTAAA GGAGACAAGT GCG C CTATGT GTAATGATAC 132 0 

CTTAGAATTA CTAAGAATAA AAGATGAAAA TATAAAATAT ATAAAC CAAG AAATTGACGT 138 0 

CATTATCAAA GGAAAAAAAG CAACAGTGGT TAATGCTGTA CTAACGTATA AGCCTTCGGC 14 4 0 

50 CTGTTATTGT TGTGGAGTTA AAAATGAAGG ACAAATTCAT AAACATGGTA AGCGTGTTTC 150 0 

TCGTATTACT TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 156 0 

55 
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1680 



GTGTTTTATC TCAAGATGTG TTACTCAAAA AG TT AT AG AA GAAGCTACTA AAGTTAAAAC 

AGAGATTGAT ACTGCAGAAG ATAACTGTAT CTCTCCATCT ACTGTAAGTC GTATTAGAAC 174 0 

TAAAGCGGCT AATTCATTAC GAATTAAACC CTTTAATTGT TTGCCAGAAC ACATCGCTAT 18 00 

GGATGAATTT AAAAGCGTTA AAAATGTAAC TGGATCAATG AGTTTCATTT TTATAGATAA 1860 

TGATACTCAT GATGTTATAG ATATTTTAGA AAATAGAACT ACAAGATTCT TGCGTGCCTA 1920 

TTTCGAGCGA TTCGATTTAA AAAATCGACA ACAAGTTAAG ACGGTTACTA TTGACATGTA 1980 

TGAACCCTAT GTCCGATTAT TTCGCGACCT ATTTCCTAAT GCAGCTATTA TTTTTGACAG 2 04 0 

ATTCCATATC GTTCAACATT TAAATAGAGA ACTTAATAAG TATCGTGTAC AAGTTATGAA 2100 

TGAATACCGT AATAAAAAAG GACCTGATTA TACAATTTTT AAGAATAACT GGAAAGTCCT 2160 

ATTGATGGAT ACTAGTAAAA CCATATTTAG TAAATACAGA TGGAATAAAT CTTTTAAGGC 22 2 0 

TTATAAACGC TCATCTGACA TTGTAGAATT CATGCTTTCA AAAGACGATA TACTACGACA 22 8 0 

CTCCTACGAA CTTGTCCAAG GATTACGAAA AGACCTAAGG TTATGTAATT GGCCTAAATT 234 0 

TATTAATCGT TTAAATTCAG TTAGTAAAAA GTCTGTGAGT AAGGGTGTAT GGAAAGTGGT 24 0 0 

T AAAT ATTAT AGAAAACATC AAAGGATGTT AAGAAATACA ATTTATTACC CAGCATTTAA 24 60 

TAATGGTGCT ATAGAAGGAA TTAATAATAA GATAAAATTA ATCAAGTGAA TTTCTTTTGG 252 0 

TTACAGAAAT TT CAACAACT TTAAAGCACG TATAATGATG aTTTTCAGCT TGTACAAAGG 258 0 

AGaAAAAAAG AAGACAAC CA AGC CCAATAA TGGACTGGCC GCCTAATAwA nGGGskCTAA 264 0 

AAGTTgTATT TTTAAAAATA GTTCCTTTAA ATTATATACC CnCCACATTT GGGGGAGGAC 2700 

CTAAAAAAGC 2710 
(2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC Ct GTGTTTTT 60 

AAAAGGtACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 120 

GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 180 

GCCCTCACAA GAAGAT AT CA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 24 0 
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GGGTTACTTT GATGAAAATG ATTGAAGTTG ATAGAATTAA TGACGTATCT TGTGTATAAT 360 

ACCTACGAAG TACTTTCATT GGAGGAAAAA TAGTGACTCT ATTTATTATT AT CGGGGTTC 4 20 

TCGTGCCAAT GGTTTATACC ATGCAGTTAA ATATTAAAAA TGAACCTGTa ACAAAGCGCA 4 80 

ATCTTTTAAT AACATTAGCT TTATCTACGT TAGGTATTTT AGTAACCGCG TTAGCAGGTG 54 0 

TAAT CGTTAC GAAACAAGCT TTTC CTTTAT TAAGTGTAGC AATTGGCTCA ATTTTTACTG 60 0 

GAATCGTTTG GGGCCTTTTA CTAAGTGGTA GcTACGCGCT GATACGATTT TTATCTAACG 66 0 

CATTTGGGCG TAAGTAATAG TAATCTGTTC ATCAAGTAGT ATCCGTGCTT GAAAACAAAC 720 

TAAAACTCCT AATGTGGAAC TAGTTAAAAA ATTCTAAACT AGTACATTAG GAGTTTTGTT 780 

ATG CAGAATA AATTTAATTG TTAAATTGAA AGTGCGGTAA AAATCCACTA TTTATTTGAA 84 0 

AAAAATCGAG CAACCAAATT AAAC CTTGAT ATCCTAAGTA AATACATAAT AAAACAAGTC 90 0 

CGACATAAAT TAAAAATCGC AAAATAGAAA GTCCAACTCT AAAAAGGAGG ATGACTAATA 960 

GTGCTATTAA AATAATTGTt AGTATACTCA ATGCTACAAA CCTCCTArTA CGCTTTTAAA 1020 

TCCATAA 102 7 
(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

CGGCACATTT TTAATTTATA CATATTTTAA AACTAAGTAA CAGTTTGAAG AAATCGTAGT 60 

TCAATAATGT TAATTGTGAA AATGTATATA AACATAAAAA AATCATGTAT AATATATGTT 120 

GTTAATTAAA CAGTTCGAAA GCGAGATGAC ATTATGGGAC GTAAATGGAA TAACATTAAA 180 

GAAAAAAAGG CCCAAAAAGA TAAAAACACA AGTAGAATAT ATGCGAAATT TGGTAAGGAG 24 0 

ATTTATGTTG CAGCAAAATC TGGTGAACCC AATCCAGAAT CTAACCAAGC TTTAAGGTTG 3 00 

GTGCTTGAAC GCGCTAAGAC AT ATT CAGTG CCGAATCATA TTATTGAAAA AGCAATAGAT 360 

AAAGCTAAGG GTGCTGGAGA CGAAAACTTT GATCACCTAA GATATGAAGG ATTTGGCCCA 4 20 

AGCGGATCAA TGCTAATTGT TGATGCGTTA ACAAATAATG TAAATCGTAC TGCCTCTGAT 480 

GTGCGAGCTG CTTTTGGTAA AAACGGCGGT AATATGGGTG TATCTGGATC AGTTGCTTAT 54 0 

ATGTTGTCAT GTGGCCACAT TTGGTATTG 56 9 



1323 



EP 0 786 519 A2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
s (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

10 TGGACCTATA AAATGGATGC ATCCAAATAT GGATGnGGGG TAnGGCGGGa AAATAGGaGG 6 0 

TTATATAGTA AGTCATTTAA GTCGAATAGC CATCCTTTTT AAAATGAAAA AAATAGAAAG 12 0 

CTCAATAGTT TGTTAAAGCC TTTAAATAGC GTCGTTACGC AATTTTAGAA TGCTAAAAAT 18 0 

15 TGTCACAAAT TTCAAATTAC GTGCTAAAGC TTGAGATATC AATATTTATT GGCGATAGAG 24 0 

TGTAATTTGA CTCCGCTTAC ATCTCAGTTT TATGTTTGTA ATTTGGTAGC ATAATATTAT 300 

AATAAAATAA AATTGTTAAT CTTTAATTTC AGTATAGATA TTTTTACGTG TAGTCACGTG 36 0 

20 

TAAAATAAAT TCAATTAGGT TAGGAGACAT AATTATGAAA ACATTTGGTA AAAAGGTTGT 42 0 

ATTAATCGGA GATGGATCTG TAGGATCAAG CTATGCCTTT GCAATGGTTA CGCAAGGTGT 48 0 

TGCTGATGAA TTTGTAATTA TTGACATTGC AAAAGACAAA GTAAAAG CAG ATGTTCAAGA 54 0 

TTTAAACCAT GGTACAGTCC ACAGTCCTTC ACCAGTTGAT GTGAAAGCAG GTGAATACGA 600 

aGaCTGTAAA GATGCAGATT TAGTTGTTAT TACAGCTGGT GCACCTCAAA AG C CAGGTGA 660 

3o AACAC GTTT A CAATTAGTTG AAAAAAATAC TAAGATTATG AAGAGCATCG TTAAGAGTGT 72 0 

TATGGATAGT GGCTTTGATG GATATTTCTT AATCGCGGCA AACCCTGTAG ACATTTTAAC 78 0 

AAGATTTGTA AAAGAATATA CTGGATTACC AGCAGAGCGT GTTATCGGTT CAGGTACTGT 84 0 

35 ATTGGACAGT GCACGTTTAC AATATTTAAT TAGCCAAGAA CTTGGTGTTG CACCTTCAAG 900 

TGTTGACGCT AGTATTATTG GCGAGCATGG TGATACTGAA CTTG CAGTTT GGTCACAAGC 96 0 

AAATGTAGCA GGTATTTCAG TATATGACAC ATTAAAAGAA CAAACTGGTA GCGAAgcTAA 102 0 

40 AGCGGAAGAA ATTTATGTGA ATACACGTGA CGCTGCTTAT GAAATTATCC AAG CTAAAGG 108 0 

GTCAACATAC TATGGTATTG CATTAGCATT GATGCGCATT TCAAAAGCCA TTTTAAATAA 114 0 

TGAAAATAAT GTCTTAAATG TTTCTATACA ATTAGATGGT CAATATGGTG GTCACAAAGG 12 00 

45 CGTTTACCTA GGTGTACCAA CATTAGTTAA CCAACATGGC GCAGTTAAAA TTTATGAAAT 12 60 

GCCATTAAGT GCCGAAGAAC AAGCGTTGTT CGATAAATCT GTTAAAACAT TAGAAGATAC 13 20 

ATTTGATTCA ATTAAATATT TATTAGAAGA CTAAGCCTAT TTTAAGTATT AATTAGAAAT 13 8 0 

50 

ATATTAATGG TAAGAGGATC TATTAGTGTT GCAGTAACAC GTGGCACTGA TAGGTCCATT 14 4 0 

55 
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2040 
2100 
2160 



TATGACTCGT GAAAATTTAG TGACATGAAT AAAAATGTTG AGGCGTCATT GAGTATAAAG 1560 

GAAAGTAGTT CTGCATTAAT CACGAaGTAG AGCATGACmA CGAGGAATAA CTATAGGGaG 1620 

ATGGTTTTGG AATGACGAt G TCTTGTATCA ACATGGTACA TTAGGTACGT TAATGGCTGG 16 80 

CTTACTAGAA GGCACAGCTA CAATTAATGA ATTGTTAGAA CATGGGAATT TAGGGATTGC 1740 

AACGTTAACA GGGTCTGATG GCGAAtAATA TTTTTAGACG GAAAGGCATA TCATGCTAAC 18 00 

GAGCATAAAG AATTTATAGA ATTAAAAGGC GATGAGAAAG TACCGTATGC ATCGATTACT 18 60 

AATTTTAAAG CGAGTAAGAC ATTTCCATTG CAACAATTAT CACAAGATGA TGTATTTGCA 192 0 

CAAATTAAAA ATGAAATGTT AAGTGAGAAT TTATTTTCGG CTGTTAAAAT TTATGGCACA 198 0 
TTTAAACATA TGCATGTACG AATGATG CCT GCTCAGCAAC CGCCATATAC ACGTTTGATT 
GATTCAGCAC GCAGACAACC TGAGGAAAAA AGACAAGATA TTCGTGGTGC CATTGTTGGA 
TTTTTTACAC CAGAATTATT TCATGGCGTA GGGTCTGCTG GTTTTCATAT ACATTTTGCG 

GATGATGAAA GAGCTTATGG TGGACATGTT CTTGACTTTG AAGTGGATGA CGTTGTCGTT 222 0 

GAGATACAAA ACTTTGAAAC ATTCCAACAA CATTTCCCGG TAAATAACGA GACGTTTGTT 22 8 0 

AAAGCGAAAA TAGACTATAA AGATGTGGCA GAAGAAATTA GAGAAGCTGA ATAGTCTAAT 234 0 

ATAATTAAAA GACCTTAGCG ATATTAGGAA CAGGTGGTTC TAAATGCATC GTTAAGGTCT 2400 

TTTTATATTA TATATGTGCT TACATATTTT GTTGATACGC GCGTAAAAAG CTATAAATAG 246 0 

TATCAAGATA TGGGGTGTCA AGAATGTGTT CACGTGCACG TCGATAAATA AAGCCTTGAA 2520 

TCGCTTCGAC TTCAAGAGGT TGCTGATGCG CAATGTCGTA ATACATGCTC GTTCCCATTT 258 0 

CGTCGGGATA ACCTTGATAT ATAGTCATAA TAGTATCGAC AGTTTGTTCT GAAAAGTTTA 264 0 

ACCCCTCTGC CTGTGCAACA CGACAACCAT CTAGTAATAG CTGTCTACAC AGTGTACGTA 2700 

TTTCAGGATT ATGCATGATT GCAACGGTCT GTCTACCGAG TGCTGTGATA GAATTAATGC 2760 

CCAAGTTAAC TAACAATTTA TACCAAATAG CTTGTTGAAT ATTTGCTTCT AATACAATGT 2 82 0 

CGATTTGACT GTCTTGGACC AAATCTCTAA ATTGTCGAGT TAATGCATTA TCTTGTATAC 2880 

GTAATTGATA ATCTCTGAAG TGCGTAACAA CATCGCCTTT CTTTTGACCA CTTATATAGA 294 0 
CAACTGCTTG GCATACGTTT TTAAATGAAA TATGTTCGAG TTGACCATAA CCATTTTGGG 
CTAAAATGAT GAGCGTGTCC TCATGTGCAA GATGAGTTAA ATGAGGAATA ACTGCATCAA 

GTTGATGTGT TTTGACTGCA ATAATAATAA CATCAAATGT GTTTGTGACA TCTTCATAAC 3120 

CTTTCACAAC AATATCTTGT GCAGGTGCAT GTGGTACAGT ATAATATGTT ATTGTTTTGG 318 0 

CGTGTCTCCC GATAAGCGTT GTATGAGGCA ATGATTGTTG TAATTCATAG GCAATAGTTG 3 24 0 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 425: 



AGAACGAACC 


TTTAATACGT 


ACGTGTCGAC 


CACGAACTGT CATGACAAAT TGATAAAACG 




AATAGTTTTT 


CATTAGTTCA 


TTGTCACATC 


AATCACTTTT GTkTCACCTT TAATCACAGC 




ATTTTCATCA 


TAAATATTAA 


TTGAAGCTGC 


TTGATCAGTG TTAGTAATTA TAATTGGTGA 




AATTACAGAT 


TTAGCGTTAT 


TATTAATATA 






GACGTTAACT 


TCGTCACCAC 


TAGACACTAA 


CACTTCAAAT CCTTCACCGT CTAATTGAAC 


300 


TGTGTCTAAA 


CCGATATGAA 


CTAATAATTC 


TAATCCGTTA T CTGCTTTT A ACCCAATCGC 


360 


ATGCTTAGTT 


GGAAAGACAT 


TGTCAACACG 


TCCTGCAATT GGAGACACAA CTTCTCCTTC 


420 


AGTTGGATTA 


ATACCAAAAC 


CTTCGCCCAT 


CATTTTTTGT GCGAATACAG GAT CTGGAAT 


480 


ATCTTCAATT 


TTCACGAATT 


CTCCAGTTAA 


TGGTGCATAA ATTGCGATAT CTTTCTGAAC 


540 


TTCTTTGCCT 


TTTCCGAATA 


ATTTTTTAAA 


CATACTTTCC ACTCCTACTT ATCAAAATGT 


600 


GATATTAAAT 


CGCCATAACC 


CAATTCTTCT 


AACTTTTCAT ATGGAATAAA TTGAATTGCA 


660 


GCGGAATTGA 


TACAGTATCT 


TAAGCCGCCA 


CTTTCTTTAG GTCCATCATT AAAGACATGT 


720 


CCTAAATGAC 


TATTTGATTC 


TTCTGAACGC 


ACTTCAGTTC TCAACATACC AAATGATTTG 


780 


TCGACTAATT 


CTATAA 






796 



(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATT AG CTT 60 

TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTGTT GTTTTTCGAC CCCTAGTATA 12 0 

TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTGT CTAAAATACG 180 



ss 
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ATGTGAGCCT TGTGCGATTT GTAAATAACC TAAACGTAGT ACTAGTACTG CAAAAATAAA 3 00 

TACAATCACA CCAAATATAA AGTTAATTCT CTTGTTAATT GTATTTTGAA CGATTTCATC 360 

5 ATTTGATTTT TCTTTTAGTC TTTTTAACAA AACTACCTAC CTCTATTCAA AGTCTTTCAC 42 0 

TTTAAATCAT ATATGAATTT AGAAATTATT TCTATCTTTT TGACAAAAAA ATAACGGTCT 480 

CATTtAAGAG ACCGAACAAG TAATCATACT TTATTTTGTT GCATTATATA ATTCGTCAAC 54 0 

'° TTTTTCCCAG TTAACTACAT TCCAAAATGC GCCAATGTAG TCAGGGCGTT TGTTTTGATA 600 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC TAAACCTAAA ATAGGTGTTT TACCCTCAGT 660 

TAATGGATTA TCTTGGTTTG GTGTAGTCAC AATTTCTAAC TGGCCATTGT TTACGACTAA 72 0 

15 

CCAAGCCCAA CCTGAACCAA AGCGTG CAGC TGCTTTGTCA GCAAATTCTT TTTTAAATTC 780 

TTCTAAAGAA CCCCATTGTT CTTTAATTTT TTCTACTACA GTACCTTTTT CTT CTGAGTT 84 0 

TGGTGAAAGT AACTCCCAGA ATAATGAATG GTTTAAATGT CCACCGCCAT TATTACGTAC 900 

20 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC TAAATTAGCA ACAATTTCTT CAATAGATTT 960 

AGATTCTAAA TCTGTACCTT CTACTGCAGC ATTTAATTTC GTAACATACG TGTTATGATG 1020 

25 TCTGTCATGG TGAATTTCCA TAGTTTCTTT GTCAAAATGT GGTTCTAATG CATCAAATGC 108 0 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT AAATAATCAT CCTCCTAAAA TGTCTGTAAG 114 0 

TAAATAATAA CAAGCAGTGA CTGGTT CAAC AAAGAATTTG CTTAAATTCT ACTACTTATT 1200 

30 ATTTTCTCTA CTCATTTAAT AT AACT CAAA TCAAAAATAA TTAAACATTT TGTATATAAA 1260 

AAGTTAACAG ATTTGCCATA AAATCATACG AACGGAG TAT GAAATGAACC TTTATCTTCT 132 0 

ATAATTTAAA AAATGAGATT TATGCATACA TCGGACCAAA TGTGCATAAA TCTCATTTCT 13 8 0 

35 TTATATTAAT CTTGGcAAGA CTCACATACA CCATAAACTT CAAGTTTGTG TTTG TGAAT A 144 0 

TTAACACCAG GTAGTGATAA TTTTATCTGA TCTATTGGAC AATAATCTAT TACCTTTGTA 15 00 

TCTCCACACT TTTCACAGAT AAAATGATGA TGATGATGGT TTGTACAAGC GATTCTAAAC 1560 

40 TTCATTTCAC CATCAAGTTC TGTATTTTCA ATAATT CCTA AATCTTTAAA TAAGTGCAGG 1620 

TTTCTATATA TTGTGTCGAA TGAAATTCCA GGATAATTTT CATCCATAAC TTGTTGTATA 1680 

TACTTTGCGT TTATATACTT ATCTTCTTCG ACAAAAATAT CTAACATATC TTTAC GTTTA 174 0 

45 

T CTGTATATT TTAAACCGTT CTCTTTTAAA ATTTTAATAG CATCATTTGT ATTCATTGAT 1800 

ATTAGCTCCC TTTTTAAACT TCATTCGCAT TTTCTGATAA GCCATTGTAA TCATAAGTAA 1860 

AATAACAAGT AGAACTACAA TTACACCACC CGGAGAAATG TCCATATAGA AAG CTAGG AC 1920 

50 

TAAGCCTAAT ATTACTGATA ATTCAC CTAA AAATACACTT AGTAATATCA ATTGCTTAAA 198 0 

55 
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TATCCCTACA ACACGCATTG AGGCAGAAAT AACCATCGCT ACAATAACAA TAAATAAAAA 2100 

TTGAATCCAT TTAGGAATGC CAATGACTTT ACTATATTCC TCATCAAATG ACAATATAAA 2160 

5 TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA CAATAATCGT 222 0 

TGTTAAATCA CTTATATTCA CTGCGCTTAT TGAGCCAAAT AGCAATCCAA CAATTTCTTG 2280 

ATTGAACCCA TCAGCTAATG AAATGAAGAT TGCACTCAAG GCGATACCAG CACTCATTAT 2 34 0 

'° AATTGGAATA GCAATTTCTT GGTAAG CAGT GTATGACGTT CTTAATTTTT CAATTAGAAG 24 00 

CGCACCTACT ATTGCGAATA AGATTCCAAA CCACATTGGA TTAATAAATA CTAGTGTTGG 24 6 0 

CATAATAGTA AGTAAAAACA TACCGAAAGA TATACCACCT AAAGTTACAT GACTTAGAGC 2 520 

15 

ATCAG CTwTA AGTGATAGTC GTCTAACAAc GATAAAAGCA CCGaTTAGAG GCGCAATAAA 258 0 

ACCTAt CAAG ATAcCACTAA TTAAAGAGTA CCTCATAAAA TCAAAATTCA ATAATGCATC 264 0 

TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTTTATTA ATTACAACAT 27 0 0 

20 

TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT TGAAATTTCA 2 760 

ACTTCATCAA GTGATTTAAA CT CATCAGTT GTAC CATGGA AATGCAAATG CTTATTTAAA 2 82 0 

CATGCTACTT CAGTAGCAGT ATCTGCTACA ACACCGATAT CATGAGTAAC TAAGATAATG 28 8 0 

25 

GTGATACCTT CTTGTTTTAA TTGATCTAAA GTATTATAAA ATTCACTTAC ATGTTTTGCA 2 94 0 

TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTG cAG GTTCTGAAAT CAATGCTCGA 3000 

30 GCAATCATTa CACGTTGTTG TTGACCACCT GATAATTCTG CTATATTTTT ATGAATTAAA 3 06 0 

TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACTTTTT CATTATCTTT GCTATTAAAT 3120 

GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA CAACTTCTTT AACACTTGCT 318 0 

35 GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT AATTGATGTT 324 0 

TcTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA GGTAATAACC 3 300 

CTAGAATCAA CTTCAATAAT GkTGATTTAC CAGCACCATT TGGkCCAACA AwTGCtAAAA 3 36 0 

40 ATTCACCTTT ATTTATTTTG AATGnnnTAT ATT 3 3 93 
(2) INFORMATION FOR SEQ ID NO: 427: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNFSS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 



55 
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CCCTGTTTCT TAGCCTTTTC GTATATGCGT TCATAAGAAG AATTCT TTTT GATTTCAGAT 12 0 

TGTAAAGCAC TGTTTTCACT TGATTGTTTA TCTATTTTAT AATCTAAATC TGCAATCTTT 180 

5 CCTCGCGTAT CATACGCATC CATTTTTAAA GATAGCATAT AAATACTTAA CATAGCAATT 24 0 

ACAGTAATCA AAGTTATGTA TAAAACTTTT TCAAATTTAG TTAATTGTAC AACCACTTTT 3 00 

CTCGAAACAG TCTTCTTTTC GGGCTTAGTT TGTGGTTGTT GCTTCGGTAT ACTATTATAA 360 

ACTTGTTCGT CATATGGTTG GTACACTTTT TCTACAGCCA TTATAAATTG CTCCTTATTT 42 0 

AAGTATTTCA GCTACACGTA ATTTCGCGCT TCGTGCTCTG TTATTGTCAT CTAAATCTTC 4 80 

TTCTGTAGCG GTAATCGGTT TACGATTAAC ACGCTTTAAC TTAGGTGTAT ATGCTTCTGG 54 0 

15 

TATAACTGGT AATCCTCTTG GTACCTCTGG ACCTTTTTCA TATTCTTGGA ACACCTGTTT 6 00 

ACATAAACGA TCTTCTAAAG AATGGAAAGT GATTACCGAA ATCCTGCCAT CTACTTTCAC 660 

2Q TAATTCAATC GCTTGTTCTA TTGAATCTT C AAAAGCTGAC AATTCATCGT TTACTGCAAT 720 

TCGTAGTGCT TGAAATACTC GTTTTGCAGG ATGTCCGCCT TTTCTTCTTG CTTTTGCAGG 780 

AATACCTTCT TTTATAATGT CAACTAATTC TAATGTTGTT GTTATTGGTT GTTGTTCGCG 840 

2S ATGTGCTTCG ATTCTTCGAG CTATCTGTTT TGAAAATTTC TCCTCGCCAT AG CG ATAAAA 900 

AATCTTCACT AACGCTTCAT ATGACCAATT GTTAACAATT TCATATGCTG TTAGTTCTTG 960 

TGTTTGGTCC ATACGCATGT CTAATGTTGC GTCATGGTGA TAACTGAATC CTCGTTCTGG 1020 

30 AATGTCGAGT TGTGGGCTTG AAACACCCAA GTCGTAATAA ATTCCATCTA CTTTTTCAAT 1080 

GTTTAAGTCT TTTAATATTT GAGTTAATTC ACGGAAGTTG CTA 1123 
(2) INFORMATION FOR SEQ ID NO: 4 28: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 838 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 

TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 60 

AC GTATT AT A GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 12 0 

ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCTG ACAGAATTGA 180 

AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 240 

50 

CAAATGAAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAACATGT TCGTATGCAT 300 
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CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGCGGCAC AAGGGGCTTG GGTGAAAAAT 420 

GCCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 4 80 

TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAATAAA TGAATTATAT 54 0 

AGTTCATAAT TTCGACTATA AGTGG CT ATT AGCCATTATT TTTCGGGATC TATGTCAAAT 600 

CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTTTTTTA 66 0 

TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 720 

TGTTTCTTGA ACTATATATC AT CTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 780 

TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 83 8 
(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 9: 

nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 60 

TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 120 

TAACTTCTAA TAATACTGGG AAGCCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 18 0 

CTTGTACAAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 24 0 

CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 300 

GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 36 0 

TTGATTTTAC AATATCAATA AT CAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT 420 

CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 4 80 

TCGTACCAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 54 0 

GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATC CAGTA A CACTAATT C 600 

CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACACCTGGT TTATGaACAT 660 

ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 72 0 

CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACCTCAA 780 

CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 840 

ATTGATTTCA AGTCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 900 
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AAACCATTGT TGTCAACTTG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 
ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 10 80 

GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTt GTAC TTCATTAaCA H4 0 

GGtATTGCAA 1150 
(2) INFORMATION FOR SEQ ID NO: 43 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

AAGAATATTT AGGATTTTAT GTTTCGCAAC ACCCAGTAGA TAAAAAGTTT GTTGCCAAAC 60 

AATATTTAAC GATATTTAAA TTGAGTAACG CGCAGAATTA TAAACCTATA TTAGTACAGT 120 

TTGATAAAGT TAAACAAATT CGAACTAAAA ATGGTCAAAA TATGG C ATTC GTCACATTAA 180 

ATGATGG CAT TGAAACTTTA GATGGTGTGA TTTTCCCTAA TCAGTTTAAA AAGTACGAAG 24 0 

AGTTGTTATC ACATAATGAC TTGTTTATAG TTAGCGGGAA ATTTGACCAT AGAAAGCAAC 300 

AACGTCAACT AATTATAAAT GAGATTCAGA CATTAGCCAC TTTTGAAGAA CAAAAATTAG 360 

CATTTGCCAA ACAAATTATA ATTAGAAATA AATCACAAAT AGATATGTTT GAAGAGATGA 42 0 

TTAAAGCTAC GAAAGAGAAT GCTAATGATG TTGTGTTATC CTTTTATGAT GAAACGATTA 48 0 

AACAAATGAC TACTTTAGGC TATATTAATC AAAAAGATAG TATGTTTAAT AATTTTATAC 54 0 

AATCCTTTAA CCCTAGTGAT ATTAGGCTTA TATAACTTTy ATCAACTAAT AAATTATGAT 600 

ATAGTAAACT GATGGTTAGA TATTTyTyAA CCATCAGTTT GCGTktATAT TAGTTTTTTA 660 

TGCTTATTAT TTTTATGAGT TTCACTTTAC ATTATTGATT AATCCATAAG AATAATTAGC 720 

ACAAAAAGCA GTATACATAA ATTGAGTAAA GAATTTTGTC GATATAACCG nGCGGAAAAA 780 

TAATAACnTT TTGGATn 797 
(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TTTCCAAGAG AAGTTGCGGA AGTAATTAAT AAAACGCATC ATAATAAATT GGTCATTTCG 6 0 

ATGATTTCGk. CACAAATCGA TGCGGATAGA ATGGATTATT TACAACGTGA TGCGTATTTC 120 

ACAGGTGTAT CATATGGTGC TTTTGATATG GAGCGTATTT TAAGATTAAT GCGACCTTCT 180 

AAAGATGAAG TACTAATCAA AGAAAGTGGT ATGCATGCAG TTGAAAACTT TATTATGAGT 240 

CGTTATCAAA TGTATTGGCA AATTTACTTC CACCCAGTTA GTCGTGGTGG AGAAGTGCTG 3 00 

CTTAATAATT Gt TTGAAACG CGCAAAACAG CTTTATAATG AAGGCTATGA ATTTAAGTTG 360 

CATCCACATG ATTTTATTCC ATTTTTTGAA GAGACAGTTA CGATTGAACA ATATGTTGAA 420 

CTCGATGAAG CGGTAGTTAC GTATTATTTG GAAAAATGGA CAAAAGAAGA TGATGCTATT 480 

TTAAGTGATT TAGCAAGTCG ATTTATTAAT CGAGACTTAT TTAAATATAT TCCATTTGAT 54 0 

GGCTCAATTA TTACAATATC AGAACTGCAA GAACTGTTTG AAGCAGGTGG TATTAATCCA 600 

GATTATTATT TTGTGAGTGA AGCATTTTCT GATTTGCCAT ATGACTATGA TCGACCGGGG 660 

TCAAATCGCA AACCGATTCA TTTATTAAGA CAAGATGGTA CGATTAGAGA AATAAGCAAT 720 

CAATCATTAG TCATTCATAG TATTACAGGC ATTAATCGCC AAGACTATAA ATTATATTAT 78 0 

C CT AG AG AAA TGGTTGCAAA GATTAAAGAT AAGACAATTA GAGAAGCTAT TGAAAATTTG 84 0 

ATTAATGAGC TTAATTAAAC AGGGCTAAAA TTGTTATCGT TAAATATGGA GGTTATATCA 900 

TTGTCTGAGA AAAAAGGCTT TAATTTTAAT ATCATAAAAA ATGACCCTCT AGATGGTCAT 960 

AAAGGTACAA ATATTGGTTC AATTAGCTTA GACAATATTG CACCAGTTTT TATCGATGTT 1020 

GCTAACAAAG AAGCATTTAT TGATATTGGA GGCATGCATG CTCGTGCCAA AGTTGAAAAA 1080 

GGTGTGAAAT GGATTACTGA TAAAGCTGCT GTTGAAGGCG ATGAAGCTAA AG aATATTGG 114 0 

TTGTGTTGGG TAACAACAGA ACGTAATGAA CAAGGACCAT ATTACGCTGG TTTAACAGCG 120 0 

TGCTATTTAT TAGTGAATAA AGCAATTCGT CGTGGTTATA AAAGTATGCC TGAACATGTT 126 0 

AATATGATGG ATAAATCAAT GAAACATCAT ATTATCATAG ATCAAATTGG TGACGAGAAT 13 20 

AAAGCTATTT TAAAAGACTT TTTAATGAAC CATGATGAAG GTATGTGGAA GCATTCTTCT 13 80 

GATGCTTTAC ATCAAGCATT TAATTAAATA TTAGAAACTA AAATTTCCCA ATTAATCTAT 14 4 0 

AAAGATATGA TTCATTTCTC AATGAC 14 66 
(2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 

TTGTACGAAT ACAAAGTTGC ATTAATATAG AATCATTTAT GCTTTTTAAT ATCACAAAAC 60 

AAATTGATTA AACTAGCTAA AATAGTCAAA ATTGGAATCA AGATTACATA AATATCATAA 120 

CCTCTAGATA TTAGTATACA TAATATAATT AGCGACCCTA TTATAATTGC TAAATTATAA 180 

GCCAAACTAA TTTTATATTT CATTTCAATT CCCACCTTAA TAGCATTTAT TAAAATTCTA 24 0 

TAATGTTTAC ATAGACTTCT TAACAAACTT TAACCCAGCT AAGCTAAGTA TTGAAATAAT 300 

TACACCTCCT AATATAAATA TATTACTTAA GCTTAGTAAA GGTATAATTA TACTTATCAG 360 

TCCTAAAGAC AATGTATCCG CTGcATAATT CGaTGtAGAT GAGATACTAA ATACTTTCCC 4 20 

CATCAAATGA TTTGGCGTTT TTATTTGAAT GGCAACTGAT CTTGTTAGTC CCTCTATAGA 480 

TTGTCCAAGT CCCAATAATG TTGCACCTAT ATATAATATC GCCACACTTG GAAACACATT 54 0 

AATAATCGAT AAGCCAATTC CCCAAACTAA AACACCAATA CTAAATTTAA AGATTAATCG 600 

CTTTTCTGAC AGCAAACCCA TAATCAATGA CATTAATAAA GATGCTATAC CTAAACATGA 660 

TGTAGCTAGT CCATATACAC CAACGCCCTC TTTTAATATA TTGGAAATAA ACAATGGTAA 72 0 

TACAACACGC CAAAGGCCAG TATTAATCAA TATGCAGGCA AATTGGATGA TTATAATAAA 780 

TGGAATTTCT TTAGATTGTT TCAAGAATTC CCAAGTTTCA GAAAAATCTT CTTTTGAGTG 84 0 

TCTATCAATC ATGTTGTTAT TTGTATATTT TAAAAGTGCA TTAAAAATAA ATC CTAAAAA 900 

TAG CAATATA CTACAAATAA AAAAGACGCC AACATTACCA ACTAGTATTA CAATGACACC 960 

AATTAAAGCA GGTAAAATAA TATTTGAGCC TCTTTGCAAA CTATCGATTA ACG CATT ACT 1020 

TGTTGCTAAA TGCTCCTCAT CAATAATTTC AGGAAGAATT GCCCTAAACG CAGGATCCGT 10 80 

ATAGCAGTTA ATAATGGTGA TAGCTGTAGA TATGGTTAGA AGCGTCAGAT AATTTAAATT 114 0 

TGATGTTATT GCAAGTAAAG GAATAATTAT TATAATCAAA CTTAGTATAA GATCAGATAG 1200 

ATAAAGTATT TTCTTTTTAT TATGTTTATC AGAATATGCG CCACCGAAAA TACCAAATAT 126 0 

AATAAATGGA AGTGTTTGAC TCATAACCAT CATTGATAAT TTTAAAGATG ATTGGTTTGT 1320 

CAATTCAACA GTAAACCAAA TTATTTGTAA CGAAAACAGC ACAAAACAAC TCCGACGTAA 1380 

GArATTACCA ATCAATAAAT ATGTAAAGTT TCTATTTTTC AAAACTTCTA AATACAACAT 144 0 

ATTTATCACC TCTCATAAAA ATAATTGAAT GCATCCACCA GCTTTTTTAG ACCTTCTTCT 15 00 

AAACTCTCTT TATCCAAAGC GCAATTAATT CTAATATAAT TTAGTCAGTT AAATATCAAT 156 0 

TATTTCGAAA TATACATACT ACTTGAAACA CCATACATAA CCCCCAAAAT GACTACTCAG 16 2 0 

AGGTTATATT CTACTAATTA TGATTATATT AAATATGAAA ATATTATCAA AAAAATCAAA 16 80 
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TCATT CTCAT CATATTCTAG GTTGTTTTTA ACAAACTAAA TATAGTGAAT GCAAATCAAC 18 00 

TATTATTTAA ATTATGAATT ATTTTAATTC TTTCTTCTAC GAGCCAATAA CATTAATCCA I860 

5 GCAATTCCAA TTATACTACT AAAGATCAAA CCTTTTTGCG TGCTTTCTAA ACCTGTTTTT 1920 

GGTAATTCTG CTCGTTTTTT CTCTTGATTA GCTACTGATT CTTTAGCAAT TTTAGATTTT 1980 

TTAACTTTAT CATTTTTATC CATTGAATGA ACTGGGCCAT TTGGTTTTGC TCTGTCTTTC 204 0 

'° GATAATCCTG GATTGTTAGG ATTTACTGGG CCACTTGGAT GAGTTGGTCT GCTCGGCTTC 2100 

TCTGGGTTTT CAGGTCCTTT TGGATCTTTT GGTTTCTCTC CACCGAACTC TACAATCTTA 216 0 

TCTACTGGTT GTTTtGTGAT CTCTTCTGTT GGTTGACCCT CGCCAACTTT TTCAC CTGTT 2220 

IS 

AATGGGkTCA CTGTGAwTGG TGkTGTGAwT GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 2280 

ACTCGCTCTT CACCAGGTTG TAAT 2304 
20 (2) INFORMATION FOR SEQ ID NO: 4 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

30 

GTCTTAATAT TTTACAAGAT ACAGCCTGAC AATATTTATA TTACAAACCA ATATGGTAAA 60 

GAGGCTGTGT CTCGATTAAA AAGAATTTTA GAGCAAAGTA TACCTAATAA AGAATATATT 120 

3S GATTTATCAG AGATATCATT AATTACATTT TTTAAACCAG AATATAATAA ACAACATGTG 180 

AATGAGGAGT TTAATTCTTT ATCTTCTTCG AAAATAAAGA ATTTAGCCAA AGAAAATGAT 24 0 

GGTATTATTA TTTATTTAAA TTTTAAGGAT GTTAATTATC AACTGaTAaG TGAAGGCAAT 300 

40 ACTTTTTTTT CTGCAAAACC ATATTTGCAT TGTGTGTTCA ATAAAGATAT TTTAAATATG 3 60 

GATAAAGTTG ATATAGAAAA TTTTTTCAAA AGTATAGAAG TCAAATACAG TAAATAAGAT 4 20 

CAATTATTGA ATCCACCGAC CAGCAGATTT TTTTAAAAAA TTAATACCCC GTTAATACCC 4 80 

45 

TTTGCTTCAA TTTGATGAAA AT CAATG AAA TTCAAAATTG AAGAAATCCT TTAATATCAA 540 

GGTGTACGAC AGTCTATATA ATCATGCGAA ATTCTAAAAT TTTCtGAcGT AAAAAAACAA 600 

so ATTCyTAAAG CAGCTCGTCG TTCACCTCAA TTCTCAAAAC GTTAATTGTC GGACGATATA 6 60 

TATACAAAAC ACCTCGATGT TATGTCGAGG TGTTTTTTTG CGTTTGTGTG GGGAATATGG 72 0 

AATAGAGTAT AAATGAATTA ATGTCTCAGG TATAGAATTA ATTCAACTAT GAATTATTGA 780 

55 
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TTAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATT CAAGGG AAAGAGCTAT 90 0 

CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 96 0 

GTGACTTAAT TGAAAGATTC TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 102 0 

GTTGsGATGC TGGTGTTTCT ACTTTTTAGG TTTTATATTA AaACAGATAA CTTTTCTATA 10 8 0 

TATTTAATGA GTATCTTCTT AATTTGTTTA GGAACTGCTT CT AT CATTTA TAACTATAAA 114 0 

ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 12 0 0 

GGCTTTTTAT ATCTCCATAA AACGACAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 12 6 0 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 132 0 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 138 0 

TCATGCATTT TTGGAATACT TAGTATTACA AATAACGATT TTTATTCATC TAACAAAGGT 144 0 

TATATAATGT ACTGAAGGCA ATTTTTATGT ATT ACAAAT C TAATCGTACA TGTAAAATTT 150 0 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 156 0 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 1620 

TACACTGCAG CCGATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 16 80 

GGTATTCCGG GTGCAAAGAT TGACTATCTA TTTAATGCTT TAATTGATGA TGGTCCTGAA 1740 

CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 180 0 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 1860 

GGATTATTAA CCGCAACATC T G AAGGGG AT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 192 0 

CGnAnATGAT TTATTA CGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 1980 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 204 0 

TGCAATTCGA ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2100 

CGTTATTTCT TCACCAGTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CATCATTTCC 2220 

TGTTTTATTA GCTGGTATGA G AAGTT CAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 22 80 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 234 0 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 24 00 

ATTATTACGT AAAAGTGATT TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 24 60 

TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTGAA 2520 

ATTACTAATT ATATGCAACC GAAAAAAGAG TTGATTGGTA ATATTGCTAA AA C GATTGAA 2580 
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CAATTAAGAA CACATATTGA TGAAGAAACT GGTATTAAAG CGACGCATGA AGAAGGAATT 270 0 

CTACATCCAG TGGAAATTAT TGAATCTATG CAAAAGGTAT TAACTGATGA TACTACTGTA 276 0 

5 ACAGTTGATG TTGGAAGTCA CTATATTTGG ATGGCACGTA ATTTCAGAAG TTACAATCCA 2820 

AGACATTTAT TATTTAGCAA TGGTATGCAA ACGCTTGGTG TAG CATT AC C GTGGGCAATT 288 0 

TCAGCTGCAC TTGTGCGCCC TAATACGCAA GTTGTGTCCG TTGCTGGCGA TGGTGGCTTT 2 94 0 

TTATTTTCAT CACAAGATTT AGAAACGGCC GTACGTAAAA ATTTAAATAT CATCCAGCTT 300 0 

ATTTGGAATG ATGGAAAATA TAACATGGTT GAATTCCAAG AAGAAATGAA ATATAAACGT 3 06 0 

;5 TCGTCAGGTG TAGACTTCGG TCCTGTAGAT TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 3120 

AAAGGTTTAC GAGTTACTAA TCAAGAAGAA TTAGAAGCGG CAATTAAAGA GGGCTATGAA 3180 

ACAGATGGTC CAGTATTAAT TGATATACCT GTAAATTACA AAGATAATAT CAAACTTTCA 324 0 

20 ACAAATATGT TACCTGACGT ATTTAACTAA AATAAAGATA AATGTTAAAG AGGAGTGGGA 3300 

GATTTTATGA CTAATGTTTT ATACCAACAT GGTACATTAG GCACATTAAT GGCAGGATTA 3360 

TTAAAAGGAA CTGCATCAAT AAATGAATTA TTGCAACATG GTGACTTAGG TATCG CTACA 3420 

25 CTAACAGGTT CAAACGGTGA GGTAATCTTT TTAGATGGAA AAGCTTACCA TGCAAATGAA 34 80 

CATAAAGAAT TTGTAGAATT AAAAGGTGAC GAGTTAACAC CATATGCAAC TGTAACTAAA 354 0 

TTTGTAGCAG ATACAAGCTA TGAAACGAAA GATAAATCTT CAGAAGCAGT TTTTGCAGAA 360 0 

30 

ATTAAGGAAA AGATGTTGAG TGAAAATTTA TTTTCAGCAG TAAAAATTTC AGGCTTATTT 3660 

AAAAAAATGC ATGTACGTAT GATGCCGGCT CAAGAACCAC CTTATACACG TTTAATCGAT 3720 

3£ TCAGCTAGAA GACAACCTGA ACAAACTGAA ACGTATGTCA AAGGTTCAGT CGTTGGTTTC 3780 

TTTACACCAG AATTATTCCA TGGTATCGGA TCAGCAGGAT TTCATGTACA CTTTGCGAAT 3 84 0 

GATGATCGTA ACTTTGGTGG ACATGTCTTA GATTTTGAAG TAGAAGATGT TAAAGTAGAA 3 900 

40 ATCCAAAATA TAGAAACATT TGAACAGCAT TTTCCAATTC AAGATAAAGA TTTCACTAAA 3 960 

GCAAATATTG ACTATAAAGA TATTGCAGAC GAAATTAGAG AAGCTGAATA ATGAATCCAG 4 020 

AAATATAATG ACGGTTTATG AAAATTGACT TCATAATGCG CGATTTAGAA ATGATAGTTT 4 0 80 

GTAAATATGA TTAACCATGA CTACAATAGA ACAAATATAT TTATAATTAC GTCTAAGTAA 414 0 

TAAAATAAAT CCCTTCACTA TT AGCAG TAG TGAGGGGATT TATTAGGTTC CAGATATTTG 4 20 0 

so AGATTTGCTG TTATGTTTAG ATTATAAATT GTGGTACACA CTCATATAAA ATTTACTATT 4260 

GTATAGGCCA ATCTGTTACT ACGAGAAGCA AACAACAATA ATTTACAAGT TCAATAACTA 4 320 

AAAAGACAAA CGCCAATTTT TCAGCGCTTG CCCTATAAAA CTATTTTCAA ATTATTATTT 4380 
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TTTCTTTAGA TTCACTACTT TTTTATTAC C ATCATTCAAA GTAAGCGTAT AAGTTGCTGT 4 500 

TTGGGCATTA TTAATTTTTT CTGTTGTAAC ACCACGTTGA GAAGCTAATT CATTTTTTAC 456 0 

TTTACTGTCA ATTTCTTGAT AAAGAACATT TTTATTTT C T GGGAAGATAA AGTAAGTTCG 4 62 0 

ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAC 4 680 

AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TCGTGACTCG GTA 4 73 3 
(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 

AATAAATGGT AATTTTATAT CATCAACGGC TACAAATGGG GCAGTTTCAT ATATAAATAA 60 

ACACATCTAC TCAACTGnTC TTAATTAATT AATAAATGAT TTACCAGATG A CAT ATAA CA 12 0 

GCCATTATTA GnTTAATTCT AATAGTTTAT TTAAATTTTC TTCGGTTGTC GCCCAACTGG 180 

TTGCGAATCT AACAACACGA TGTTGATCAT CGTATTTTTC CCAAACAGCA AATTTAACTT 24 0 

TTTGTTCTAA CTCTGCTATT TTCTCGTTAC TTAAAATAAA AAATTGTTGA TTGGTTGGAG 300 

AATCAAAGTA AAGACGATAG CCTTTATTTT TAAACCCGTC TTTCATCTTA TTTGCCATTT 360 

CGATAGCATG TCTGCTTATA TTAAAATATA AATTGTCCGT AAATAATTCT AAAAATTGTA 4 20 

TGCCTGTTAA CCGTCCTTTT GCTAAAAGGG CACCGTGGAT GCTTGATTCG AGTGGTAAAT 4 80 

TGTTTCGGTT CATTATTTTT CGTAAAAACA ATGGCTTCCC CGCATAATGC ACCTATCTkC 54 0 

GTACCACCTA TATAAAATAC ATCACAATAT TTAGCGrTGt CTTTAATAGT CATATCTGAT 60 0 

TGGTCACTCA TCAATCCATA CCCTAATCGT GCACCATCCA TAAATAATGG AAGCTGATAT 660 

TGCTTACATA CTTTGCATAA CTCTTCCAAT TCTGaTTTAG AGTATAATGT GCCATATTCT 720 

GTAGGATGAG AAATATATAC CATTCCTGGG AATACCATAT GGTCCTTTTT AAAATCACTT 7 80 

TTAAATGTCT CCATGTAAGT TTCAACATCT GAAGCACTAA CTTTTCCTTC CTTAGAGGGT 84 0 

ATAGTAATTA CTTTATGTCC ACT ATATT CA ATTGCACCGC CCTCATGCAC AGCAACATGA 900 

CCAGTGTCTG CTGAAATGAC CCCTTCGTAA CTTTCTAACA TTGAATTAAT AACAACCTGA 960 

TTGGTTTGTG TTCCACCTaC TAAAAAACGA ATTGTAGCAT TTGGaCAGTC AATTGTATCT 1020 

TTAATCTTTT CAATTGCCTG AGCTGTGAAT TGaTCAAAGC CATATCCCGA AGCTTGTACA 10 80 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 12 0 0 

AATTTCAAAA CTAGTTCGAT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT 12 6 0 

TATCGCGAAG TTGnATAAAT ATAT 12 84 



(2) INFORMATION FOR SEQ ID NO: 4 35: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATCGATTT 60 

20 AGATGGCACA ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 120 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACAC CTGA 180 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 24 0 

25 GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAGCAT CAGTATATAT 3 00 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 3 GO 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 420 

30 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 4 80 

AATTC CTAAA GAG CGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 54 0 

3S ATCGACAGGT GTATCGCCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC GOO 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC GS0 

CGATATTATG TCTGG TATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 72 0 

40 TACGTTAGAA GATGTGCAAA ATAAAAATGT GCCACCAACG TATTCTTTTA AAGATTTAAA 780 

TGAAG CAATA GCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 84 0 

TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 90 0 

45 

CTTGGTGACG TTGTTATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 960 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 1020 

50 TTTTAGCGCA AT CAC CAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 107 2 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 

55 
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(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCnTT TnCTTAAGTC 60 

w CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 12 0 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 180 

TACTCATTTT ATCAAAGAAC CAATCATTAC CAGCATTACC TGTAATCTCG CCATCATGAT 24 0 

' S TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAACGA TGTTTTTTAA 300 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 3 60 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 4 20 

20 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 4 80 

CTTCAGCAAG CTTTAACTCA TAGTGGC CAA CAGTTTGCGT GaAATGACTG TATGG CATG C 54 0 

25 TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTT CATT ATCATTACTC 6 00 

CTATTTGTCT CTCGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 660 

GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 720 

30 AAAACATAAA CG CTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 780 

TTTCAAAAAA AGTAATATGA ACGTTTGGGT TTGCTCATAT TACTTTTTTT GAAATTGTAT 84 0 

TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 9 00 

35 TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AGATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 1020 

CCTCTTTTAT ATCTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 10 80 

40 

CATTGG CT AT CGCTTTAATT GACT CAATTT TTCGTTTTAA TTCTGGATGA AACGCAC CGT 114 0 

TTTGCCCTGT TGTCG CATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 12 0 0 

4S TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 1260 

AGTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 13 20 

AGCCGTAGAC ACCAGTATCT CGACATTTTT CAAAAAACGC TTGTTCTCCA TAATGACAAA 13 80 

50 TAA TATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTTCTA 144 0 

ATTGATTGAA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 1500 

TGATAACTGG ACCATCAGCA ACCGGAT CAG AGAAAGGTAC TCCAATTTCA ATTATATCTG 156 0 

55 
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GTATAAATAA TTTAGTCATT TGCAAGACCT CGCTCTACCA TATATTGTCT AATTGTTTCC 1680 

ATATCTTTAT CGCCACGTCC AGAAATAGTT ACTACAATAA TATCTTCTTT CGACATCGTA 174 0 

GGCGCTAGTC TTTCAACATA ACTCAGTGCA TGTGCACTTT CAATTG CAGG TATAATACCT 1800 

TCATGTTTTG TAAAGTTGAT TAAAGCATTC ATTGCTTGTG TATCACTAGC ATTTTCAAAA 186 0 

GTTACTCTAC CAATGTCGTG GTAATAAGAA TGTTCTGGTC CAATACCAGG ATAATCAAGT 1920 

CCTGCTGAAA TAGAATGTGC TAGTTGCACT TGCCCATCTT CATCTTGAAT TAAATACATT 1980 

TTAGTACCAT GTAATACGCC AGGTGATCCT TTGCCAATTG CAAGTGCATG TTTATCAGTA 204 0 

TCATCGCCTT GACCTGCGGC TTCAACACCG TATAATG CAA CATCATCTTT AATAAATGGA 210 0 

T AAAATG T AC CGATTGCATT TGAGCCACCA CCGATACATG CTACAATTGC ATCCGGAAGT 2160 

CGACCTTCTT TCTTCAATAT CTGTGATTTT ATTTCTTTAC CAATCACACT CTGAAAATCT 2220 

CTAACAATCG TTGGGAACGG GT CTGGACCT AATGCAGAAC CTAATAAATA ATGTGTATCA 2280 

TCTACATGAC TTACC CAATA TTGCAATGCT TTATTAACTG CATCCGATAA AGTCCCTTGA 234 0 

CCATCTTCAA CTGCCACAAC CTTTGCACCA AGTAATTCCA TTCTAAATAC ATTAAGTTGT 24 00 

TGTCTTTTAA TATCTTCACT TCCCATAAAG ACAACAAGTT CCATATCAAA TAATGCAGCA 2460 

AC CGTAGCAC TAGCTACACC ATGTTGACCC GCACCAGTTT CAGCAACAAG CTTCTTCTTG 2520 

CCCATTCTTT TAGCAAGCAA CGCTTGACCT AACGCATTAT TAATTTTATG GGCGCCTGTA 2580 

TGATTTAGAT CCTCTCGTTT CAAATATATT TTAGCGCCAC CTAGGCTTTC AGTATATGAT 264 0 

GCAGCATATG TAAGTGGTGT CGCGCGTCCT AC ATACT CTG ATAAATAGTA TTCCAGTTCT 2700 

CTTTGAAACT CTGGGTCTGC TTTTGCCTCT TTATAAGCTT TTTTCAACTC AATAATTGCT 2760 

GGCATTAATG TTTCTGGAAC ATATTGCCCT CCATATTCAC CAAAGAAACC TAATTCATCT 2820 

GCTTCTGTTT GTATTTGTTT ATTCATTGTC TCTATCTCCT TTCACAATAT TTACAATTGC 2880 

TGTCATTTTT TCTATATCTT TTCGCCCATT TACTTCTATA CCTGATGCAA GATCATAACC 2 940 

TTGATGTGAT AATTTAAGTT GATTAACTGT TTGAATATTT TCAGAGTTAA TGCCTCCTGC 3 000 

TATCAAATAA GGTATGTCTT TTATGTGCTT CAAAATAGTC CAGTCATATG TTTGACCGGT 3 060 

ACCACCATAC GACACTGAGG GTGTGTCGAT AATAAATAAA TCTACGAACC CTTTATATTT 3120 

ATTTATGTTT TGGATTATGT TTTCATCTnG CAGtAAAGCT TTAGTGATTT TAATGCTTGA 3180 

ATATkTCTTT TTAATTTCCT GTAtAAAAtC AATAGATTCt GtGTAACTGT ATTGTGkTAA 324 0 

viTG a CGwATg C t TAAwACgT GTGC CAATGG T 3271 
(2) INFORMATION FOR SEQ ID NO: 437: 
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(A) LENGTH: 1553 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 

'0 ATAATGAAAT AATACTGTGT TTTATCTGCG AAATGTAtCA TTTTCTAATc GTTTCACAGT 60 

AAAATGAAAA GATAAAGTGT GTTTTTACTT GAATTTTGAC TAAAATTACT CTATATTTAT 120 

T AATTG AG CT ATG CTTATT A TTACAATTTG ATTACAAATT TTAAATTTGT TAATTGAATG 180 

' 5 ATAATATTAA ATAAAGAAAC TTACACAAGC AAATATGAGT TGTAG CCCAA AATACTTGTT 24 0 

AAATCAAAGT TGAAAGCTAC AAATAATGAA AATTATAAAC TTGAATCTGA AAGTAATTAC 300 

TATAATTATG ACAATGTTAA CTTTTAAACG CACTTATTAA TTAACTACAT AATGTTAATA 360 

20 

TCTAATTTAT TCAAGTACTT TCGCAAGATT TATTATCTAA ATAACGGGGG AAAGAATCAT 420 

GAGTtCACAA AAAAgAAAAT TAGTCTTTTT GCGTTCTTCT TATTAACCGT AATAACGATT 480 

25 ACCTTGAAGA CGTATTTTTC TTATTATGTT GATTT T TCTT TAGGTGTTAA AGGTTTAGTA 54 0 

CAAAACTTAA TATTATTGAT GAATCCTTAT AGTTTAGTAG CACTGGTTTT AAGTGTGTTC 60 0 

CTATTCTTTA AAGGCAAAAA AGCATTTTGG TTCATGTTCA TAGGCGGCTT CTTATTGACG 660 

30 TTCCTATTAT ATGCCAATGT TGTGTACTTT AGATTCTTCT CTGATTTTTT AACGTTTAGT 720 

ACTTTAAACC AAGTAGGTAA CGTAGAATCT ATGGGTGGTG CGGTTAGTGC ATCATTCAAA 780 

TGGTATGACT TTGTTTATTT CATTGATACG TTAGTTTACT TATTCATTTT AATATTTAAA 84 0 

35 ACAAAATGGT TAGACACAAA AGCATTTAGT AAGAAATTTG TTCCTGTCGT AATGGCGGCT 900 

TCAGTAGCAT TATTCTTCTT AAACTTAGCT TTTGCTGAAA CTGACAGACC AGAATTATTA 960 

ACACGTACAT TTGACCATAA ATATTTAGTG AAATATTTAG GACCTTATAA CTTTACAGTA 102 0 

40 

TACGaTGGTG TTAAAACTAT CGAAAATAAT CAACAAAAAg CGCTAgCATC TGAAGATGAC 1080 

TTAACaAAAG TATTAAAtTA TACGAAAcAA CGTCaAACmG AGCCTAACCC rGAwTATTAT 1140 

4S GGGGTGGcAA rGAAGAAAAA TATTATTArG ATTCATTTAG AAAGTTTCCA AACCTTCTTA 12 00 

ATTAATAAAA AGGTTAATGG TAAAGAAgTA ACACCGTTTT TAAACAAATT ATCAAGTGGG 1260 

AAAGAGCAAT TCACATACTT CCCTAACTTT Tt CCATCAAA CAGGTCAAGG TAAAACATCT 1320 

so GACTCTGAAT TTACAATGGA TAACAGTTTA TACGGTTTAC CGCAAGgTTC TGCCTTTTCA 1380 

TtaaaaGGAG ATAATACGTA TCAGTCATTA CCAGCAATTT TAGATCAAAA GCAAGGCTAC 144 0 

AAATCTGATG TCATGCACGG TGACTATAAA ACATTCTGGA ACAGAGACCA AGTATATAAA 150 0 

55 
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(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : double 

(D) TOPOLOGY: linear 



to 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 





ACAAAATCAA 


TCAGCAAAAT 


GATTGGATTA 


AGACGCAAAT 


TGAGCGTTCA 


ATGGAAGGCG 




15 


AAACAGTTGG 


CATTAATGAT 


CAAAATATAG 


AAATATATAG 


TGAAcGtTgA GATTTATACC 


120 




ATACACTCGT 


ACCTTTAAAT 


CAAGAATTGC 


ATAAGTTGCG 


ACTTAAAACT 


CAAAACTTAA 






CCAATGAAAA 


TTATAATATT 


AATGATGTGA 


AAGTTAAAAA 


GATTATTGAA 


GATGAACGTC 


24 0 




AAAGACTAGC 


ACGAGAACTT 


CACGATTCTG 


TTAGTCAGCA ACTTTTTGCG 


GCmAGTATGA 






TGCTATCTGC 


TATCAAAGAA 


ACGAAtTAGA 


ACCACCATTA 


GACCAACAAA 


TTCCTATTTT 


360 




AGAGAAAATG 




CGCAGTTAGA AATGCGTGCT 


TTGCTGTTAC 


ATTTAAGACC 




25 




AAAGACAAAT 


CTTTAGGTGA GGGTATTAAA GATTTAGTTA TTGATTTACA 






AAAAAAAGTG 


CCAATGAAAG 


TTGTGCATGA 


AATACAAGAT 


TTTAAAGTGC 


CT AAAGG TAT 






TGAAGATCAT 


TTGTTCAGAA 


TTACACAGGA 


AGCAATTTCG 


AATACATTGC 


GTCATTCAAA 




30 


CGGTACAAAA 


GTGACAGTAG 


AATTGTTTAA 


TAAAGACGAT 


TATTTATTGT 


TGAGAATTCA 






AGATAATGGT 


AAAGGTTTTA 


ATGTTGATGA 


AAAATTAGAA 


CAAAGTTATG 


GACTTAAAAA 


720 


35 


TATGCGTGAA 


AGAGCTTTGG 


AAATTGGTGC 


AACGTTCCAT 


ATTGTATCAT 


TGCCAGATTC 


780 




AGGTACACGT 


ATCGAGGTGA 


AAGCACCTTT 


AAATAAGGAG 


GATTCGTATG 


ACGATTAAAG 


840 




TATTGTTTGT 


GGATGATCAT 


GAAATGGTAC 


GTATAGGAAT 


TTCAAGTTAT 


CTATCAACGC 


900 


40 


AAAGTGATAT 


TGAAGTAGTT 


GGTGAAGGCG CTTCTGGTAA AGAAGCAATT GCCAAAGCCC 


960 




ATGAGTTGAA 


GCCAGATTTA 


ATTTTAATGG 


ATTTACTTAT 


GGATGACATG 


GATGGTGTAG 


1020 




AAGCGACGAC 


TCAGATTAAA 


AAAGATTTAC 


CGCAAATTAA 


AGTATTAATG 


TTAACTAGTT 


1080 


45 


TTATTGAAGA 


TAAAGAGGTA 


TATCGTGCAT 


TAGATGCAGG 


TGTCGATAGT 


TACATTTTAA 


1140 




AAACAACAAG 


TGCAAAAGAT 


ATCGCCGATG 


CAGTT CGTAA 


AacTTCTAGA 


GGAGAATCTG 


1200 


50 


TTTTTGAACC 


GGAAGTTTTA 


GTGAAAATGC 


GTAACCGTAT 


GAAAAAGCGC 


GCAGAGTTAT 


1260 


ATGAAATGCT 


TACAGAACGA 


GAAATGGAAA 


TATTATTATT 


GATTGCGAAA 


GGTTACTCAA 


1320 




ATCAAGAAAT 


TGCTAGTGCA 


T CGCATATTA 


CTATTAAAAC 


GGTTAAGACA 


CATGTGAGTA 


1380 



£5 
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<2) INFORMATION FOR SEQ ID NO: 439: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 

ATGTnGG CGG ACGTAAAGGA CATGTTTATA CTGATGATCG AGCATTAGAT ATTGATATCG 60 

TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA A CAATTATTT GCAGCAGGTT 12 0 

ATGCATCTTG CTTCAACGGT GCTTTCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 18 0 

CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAGACTCA GAAAGTCCTA 24 0 

AATTAAGTGT TTCAATTGAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCTGAAA 30 0 

AATATTTACA AATGGCTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA GGAAATATTA 36 0 

ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 42 0 

ATTAATAAAA TTCACATAAA ATTCAAATTt SCCrAcCAAA AATTTTTGGT TGGyTATTTT 480 

TTCTATTCGT GATTGAAATT TCTGGCAATA TTTAACTGAA AATGATTGTA CCTTAGTCAT 54 0 

CATAAATGTG ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 60 0 

GTCCAGTG 60 8 
(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 

CCTTTCAAtT TCTCCCAAgA TTTACGCATG TCTGACCAAA TGTCAATACA ATACCTGAAT 6 0 

AATGATTTCT GGAGTAATTG CGAAAAATAT TTAAAAGTTG CTATAGATCA ATTTTCAAAT 120 

TATAGTATCT CATCTCAAGT TTCTAACTAT CATTTCACAG TATTACTTGG AGATCG CCAA 18 0 

AAACCACTTA TGTATCTAAA TAAAAAT CGC GGTGGTGATG GTGGCATACC AGGTTATATT 24 0 

ATGATTTATT TAGTGCCGAG TACAAGTACA ATTAATTCTA TGAAAAGCTT AATTGCACAT 30 0 

GAAGTAAATC ATAATATGCG CTATCAATAT ATTGATTGGG ATGGCGGAAG TTTGATTGAA 360 
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ATTGGACCGT GGGTAACAAA TACTAATTGA AGTCGTGATA A CGTAAAAAT TAAAAATACT 4 80 

ATTTATAATC ATTTACATTT AAAGCATATA TTTGAATCGA tGCCTTATCT CTATGGTGAT 54 0 

GATATTAATA AACTTCAAGG TAGGCCTATC GTTGGCTTAT CTCATGCTGC CGGGTATGCA 600 

TGTGGCTATC ACTTGGTAAA ATACTTTTTA CAAAAAACAA ACATACCTAT TGAAGTTGCT 660 

ACAACACTTC CAGCACAAAA AA 6 82 



(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2574 base pairs 
)5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

ATCTTCTTTT AACATGACAA ATTGCAACAA AATATTAATG ACGCCAAAAG ACATTTTTTC 60 

ACGTTCAATT AATTCTTCAA CCATTGTCTT TTGCGATATA GTTGGTTCTG ATTCAGACCA 120 

25 AGAAGCTAAC ATATCAATTG GACTCGTTTG TTCAAGTAAC TCAAACCATT CATCACTTTG 180 

TGGCTTTGGA TTCACTTCTG AAGATTTGCC CGCCGAAGAT GATGTAGCAG GAGATTTCAC 24 0 

CTGTAATTTA GGCATTTGAT TTTCGTGTTC CATTAAGTAA TACGAGCGTG CTTGTTTACG 3 00 

30 

CATTTCTTCA AAGGATAACT GTTGTCCACT TGTAATTGAA TTTAAAATAA CATGCTTCAT 360 

GCCATCTGCT GTTAAACCAT ATAAAGTCGC GAGTTGTGTA ATTAAACGCT TTGCAT CTTT 4 20 

35 GGTAACAATG TCTTGACTAA TAAAATGTTT ACCTAACATT TGTCTCAACA TTTCAAAGTC 4 80 

AAAAGATTCA TTTGATAAAT CGATACCTTG GTACGGTTCA TTAATCGGaA TATCACTTGT 54 0 

ATCGATATCT ATTTTTGTAG ACGGCACTTT AAAAACATCA GTAAATTGTC TTGTTACCTG 600 

40 TTTAAATTCA CTCAAATCAA TTTGTTGATA CTCAAAGTAT TTCTTCAACT CATGAAATCG 660 

ACGATGCTCG ACTTCACTAT ATAAAAAGAT TGACAACATT GGATCATTAA AAAATAAATG 720 

TGCTGAAGGG GtTGaATTAA TTGGTAAACA AATTGTGTTT CTTGTTCATC ATGTTTGACA 7 80 

45 AACGCCTTTA ACAATC CAAT CGCTTCAAGT AAGTCCATTT GTTGTCTAAA CTCTAGTAAA 84 0 

TTAATTTTAA GTTCATTCAT AAAAATATAA TGAGAAAGAA TCAATGTTTC ATTATGACTT 90 0 

TCTTTAACGA ATTGAGTCAT AAAATGATAT AAACCCACTG CTTGCGTTCC AATTAGCGGT 9S0 

50 

GTATACAGTC GATTCAATAC CTCTAAATGA TTCGTATTTA AATCAAAGTG TTGCAT AA CT 10 2 0 

TTGAATTGAT CCTTTGGTCT TAAGCCGAAT TCGAAGGCTT GTCGTCCCaT TTAAGCnATC 10 8 0 

55 



1344 



EP 0 786 519 A2 





AAATTCTTTA 


TAGACTGATG CAAATCTAAC ATATGAAACT TGATCAACAT GCATTAACAA 


1200 




GTTCATAACG 


TGTTCACCTA TATCTCGTGA AGACACTTCC GTATGACCTT CATCTCGTAA 


1260 


5 


TTGCCATTCA 


AC CTTGTTAG TTATGTCTTC AAGTTGTTGA TATCTAACTG GTCGTTTCTC 


1320 




ACAAGAACGC 


ACAAGTCCAT TAAGTAT CTT TTCTCTTGAA AACTGCTCTC TTGTGCCATC 


1380 


10 


TTTTTTCACA 


ACTATAAGCT GACTAACTTC GATATGTTCA AATGTAGTGA AACGTGTTCC 


1440 




ACAATTTTCA 


CATTCTCTTC GTCTTCGAAT GGCATTTAAT TCATCGGCAT GCCTTGAATC 


1500 




TACAACTTTA 


GATTGTGTAG AATTACATTT CGGGCATTTC ATTACATCAC CCTCTTTATT 


1560 




TTGATTATGC 


CTAATTATAC TATAAATCTA GAGATGAAAA AAGAATCCCT CAATTTAATT 


1620 




CATTTAACCA 


AATAATGAAA CAATAAAAAA CATTATATCG TTACTTATTA AGT AATTTG C 






ATGACAATAT 


TATTGTATTA AAAATAAAAA ACCTAACTCC GAAGTCAGAG TTAGG CTATA 


1740 


20 


AATTAATTGT 


ATTAACTTGC ACTTACAGTT TCTTTTGATG TCAAAAGTGC TCCAATTTGC 


1800 




TCAGCAACAT 


CTACAACTCT ATTTGAATAA CCCCATTCAT TATCATACCA AGCAATAACT 


1860 




TTTACTTTAT 


TCCCTGACAT GACCATTGTT GATTTTGCAT CAATAATAGC TGAATTTGGA 


1920 


25 


TTAGTATTAA 


AATCAACAGA CACTAGTGGT TGATGTTCGA CTTCTATGAT ACCTTCTAAA 


1980 




CCTGCATTTT 


CAAAAGCTTG GTTTACTTCT TCTGCAGTTA CTTCTTTTTC TAAATCAACA 


2040 




ACTAAATCAA 


CGAGCGATAC ATTCTTTGTT GGTACACGTA ATGCCATGCC GTGTAATTTA 


2100 


30 


CCTTCTAATT 


CTGGTAATAC TTCTTTTAAA GCTTTCGCCG CACCAGTAGA AGTAGGAATA 


2160 




ATGCTTTCAT 


TACATGAACG TGCACGTCTT AAATCTTTAT GTGGATTATC AATATTTTTT 


2220 


35 


TGGTCATTTG 


TAATAGCGTG AACAGTAGTC ATTAAACCAT TAACTATTCC AAACTGATTA 


2280 




TTTAAAACTT 


TTGCAACTGG ACCAATG CAA TTAGTAGTAC ATGAAG CATT ACTAAAAATG 


2340 




TCAAATGCTT 


CTATATCTAA TTGGTTATCA TTTACGCCTT TAACTACCAT TTGAACATGT 






CCACCTTTTG 


nAGGACCAGT TAACAAAAsT TTTtTGGCAC CTGCTTTAAT ATGTGCGATG 


2460 




GCTTTATCAC 


CATGaTTAAA TTTACcAGTT GCATCTATAG CAATATCGAT ATCTAATTCT 


2520 




TTCCATGGCA 


AGTTTTCAGG ATTGCGATCA GCAACCAATT TAATTTTATG ATCT 


2574 


45 


(2) INFORMATION FOR SEQ ID NO: 442: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



55 
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CAAGGTACCG GTTTAGGTTT GTTCATTTGT AAAATGATTA TCGAAGAGCA TGGTGGTTCC 6 0 

ATAGATGTTA AAAGCGAATT AGGGAAAGGC ACAACATTTA TTATTAAACT ACCAAAACCA 12 0 

GAATAAAATT GAATATAGTT ATTTCAGAAC GCATGTTATT GATTAGAGAC TCTAATTTAT 180 

AGCATGCGTT TTTTGATTGA TGTGGGGAAT TTTGTATGTG GATTAGAACT TAGGGTTTTT 24 0 

GCGAATATCA ACTATTAAAT ATATTACTAA TTTATACAAA AATATAAAGT TTGATAAAGT 300 

TATTTATTTG ATTATAAAAA TAGGGTAAAA TATAGATATA TTGTATTAAT TAAATTATTC 3 60 

GAGGTGT CAT ATGAAAAAAT TCATTGGATC AGTTTTAGCT ACGACATTAA TTTTAGGGGG 420 

ATGTTCCACG ATGGAAAATG AATCAAAAAA AGACACAAAA ACAGAAACAA AATCTGTACC 4 80 

AGAAGAAATG GAAGCTTCAA AATATGTAGG CCAAGGCTTC CAACCGCCTG CAGAAAAAAA 54 0 

TGCGATTGAA TTTGCGAAGA AGCATC GTAA AGAATTTGAA AAAGTAGGTG AACAATTCTT 600 

TAAAGATAAC TTTGGACTAA AAGTTAAAGC TACAAATGTT GTAGGTAAAG ATGATGGTGT 660 

AGAAGTTTAT GTGCATTGTG AAGATCATGG CATTGTATTT AATGCAAGTC TACCTTTGTA 720 

CAAAGATGCC ATCCATCAAA AAGGATCAAT GCG CAGTAAT GACAATGGTg ATGATATGAG 780 

TATGATGGTG GGTACAGTGC TGAGTGGCTT TGAATATCGA GCGCAAAAAG AAAAGTATGA 84 0 

TAACTTATAT AAATTCTTCA AAGAAAATGA AAAGAAATAT CAATATACAG GCTTTACAAA 90 0 

AGAGGCAATT AACAAGACAC AAAATGTCGG ATATAAAAAT GAATATTTTT ATATTACATA 96 0 

CTCTTCTAGA AGTTTAAAAG AATATCGAAA GTATTATGAA CCACTGATTC GAAAAAATGA 1020 

TAAAGAATTT AAAGAAGGAA TGGAACGAGC AAGAAAAGAA GTGAATTACG CTGCAAATAC 108 0 

AGATGCTGTT GCTACACTTT TTTCTACTAA GAAAAACTTT ACTAAAGACA ATACAGTAGA 114 0 

TGATGTAATC GAACTAAGTG ATAAATTATA TAATTTAAAA AATAAACCAG ATAAATCTAC 12 0 0 

AATCACAATA CAAATAGGGA AACCCACTAT TAATACTAAG AAAGCCTTTT ATGATGATAA 12 6 0 

TCGTCCAATA GAATATGGGG TGCACAGTAA AGATGAATAA AATTAATGAT AGGGATTTAA 13 2 0 

CAGAATTAAG TAGTTACTGG GTTTATCAAA ATATTGATAT AAAAAAAGAA TTTAAAGTTA 13 8 0 

ATGGAAAAAG GTTTAAACAA GTAGACAGTT ATAATGATGA TAAGAATAGT AATTTGAATG 144 0 

GTGCTGCTGA TATTAAAATA TATGAGTTAT TAGATGATAA AAGTAAACCA ACTGGTCAAC 150 0 

AGACAATAAT TTATCAAGGA ACATCTAATG AGG CAATTAA TCCAAATAAT CCATTAAAAT 156 0 

CATCGGGGTT TGGAGATGAT TGGCTCCAAA ATGCTAAATT AATGAATAAT GATAATGAAA 162 0 

GCACAGATTA TTTAAAGCAA ACAGATCAAT TATCAAATCA ATATAAAATA AAGTTAGAAG 1680 

ATGCAGATAG ATTATCAAAT AGTGATTTTT TAAAAAAATA TAGAATGGAA TCAAGTAACT 174 0 
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ATCAAGGAGC GAAACATCCG AATGAAAAAG TTGTTGCTAC TGACTCAGCA ATGATTCCTT 186 0 

ATGCTGCTTG GCAGAAATTT GCTAGACCAC GCTTTGaTAA TATGATTAGT TTTAATAGTA 192 0 

CCAACGATTT ATTAACATGG TTACAAGATC CATTCaTCAA AGATATGCCA GGAAAACGCG 1980 

TTAACATTAA TGATGGTGTG CCCAGGTTAG ATACTTTAAT AGACAGCCAT GTAGGTTATA 2 04 0 

AAAGGAAGTT AAATAGAAAA GATAACACAT ACGATACTGT ACCACTAATC AAAATAAAGT 2100 

CGGTAAAAGA TACAGAAATT AAAAATGGAA AAAAAGTAAA AAAGACTATT AACATAACAT 216 0 

TAGATATGGA TGGGCGAATT CCAATAAATG TTTGGACAGG AGATTCGATT GCACGTTCTG 222 0 

GAAGAGGAAC TTTAATTAAA CTTAATTTAG AAAATCTTGA TGCGTTGAGT AAACTGATTA 22 8 0 

CTGGTGAAAC AAGTGGTATG TTAGCAGAAT GCGTAATCTT TTTAAATGAA AGTTTTAACA 234 0 

TCTCAGAAAA TGAAAATAAA AATTTTGCAG ATAGAAAGAa ACAATTATCA GAAGGATTTA 24 00 

AGGATAAGAT TAACTTATTT CAGTTAGAAG AAATGGAAAG AACTTTAATT AGTAAAATAA 24 60 

ACT CACTTG A AGAAGTTGCA GATGAAACAA TAGAAAGTAT TAGTGCTGTT AAACACTTAT 2520 

TACCTGATTT TGCATTGGAT GCATTAAAAG AAAGAATTAA TGAGTTGTTT AAAGGTATAA 2 5 80 

AATCTTTTAT AGAAAAAGTG TATGATAGTA TAGATAATGA AATTTTAGAA ATTTTCAAAA 264 0 

AT ATAGAT CA CGACTTCAGA GATGGAGTAT CTGAAGAAAT GATGAAACAT TTGAAAGTAG 27 00 

TGAAACAGAA TATAGACCAA ATAAAAAATC AAAATGATAT TTATGGTAGG CAAATTGCAG 27 60 

ATATTAGAAG TATTATGAAA CAA CAAGATG CAACAATTTT AGATGGAAAT TTTCAAATTA 282 0 

ATTGTAGCGG CGAAAATATG GTACAGGGTC TAGTTATACC TTCTAATTAT TTAGGAAGAA 28 8 0 

AAATGAAAAT ATTAAAAGAC CATATCGATG ATGGTATTAA AAAAATAGCA GACTATGTTC 2 94 0 

AAGGTATATA TGATGAATAT GCATCGAAAA TTGTCGATGT AATAAAATAT TTGATTAATA 3 000 

CAATTCCCAA AATACGTAAG AATTTAAGAC ATGCAATTGA AATGTTAAAT GTAAAAAAGA 3 06 0 

AAGAATTTTT GTCCCTGATT CCTAATGTAA CTT GTAATT A TATTAAAACT AAATTAGAAG 3120 

AATTAGATAA TACTTTAGGC AAATGGGAGC CTTTTCTTAA TGATTTAAAA GCAGTGTCAC 318 0 

CAATTTTAGA TAACCATTTA GATGATATTG TTAAGAACAT GAAGCCTTTG ATTGTACAAA 3 24 0 

TGAyATwTGA ACCATCACAT TATGaCGATA TGTTTAATTC aAGAAAAGCT TTAACGcCAG 33 0 0 

TGTTCTCAAG CGTTTTATAA AGGTTG 332 6 
(2) INFORMATION FOR SEQ ID NO: 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 01 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



1347 



EP 0 786 519 A2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

S ACCTATAGCT GGCCCGTCAA GTCCTGAATC ATATGTCGCT ATAAACCCAG TTGCATGCCC 60 

AGCAATCTCG GTTTCAATTT CAAAATCATG CTCTTTCAAA CGATCTATTA AAGTTCGAGA 12 0 

CGCAAATATT TCTTCATTAC CAAGTTCAGG ACGTTCATGA ATTCTATGAC TGATTTCGAT 18 0 

10 ATAACTATAT TTATTTGTTT CTATATAATC GAGAATTTGT TGTTTTTCAC TCATTTTTTA 24 0 

CTATCTCCCT TTTACCCTAC ACTCATTTTA ATCATCCACA ATATTTTGTT CTTTCAAAAT 300 

GAATTATTAC TTATTCTATC GGTTTTATCT CATGATGTCA TCTAGTTTTT CTTTATTTAC 36 0 

75 

AAAATTTTCT AATAACTAAA aGCCTTTCAT AAATTTATAA AACAGTTTCA AATTGTAAAA 42 0 

CATAAGCCCT ATTGTTACAA TTTAAGTATT GTTAGAAACA CAAC TTACAA CAAACTTAAG 48 0 

20 TTTTATAATG CAACAAATCA TAAGCGTTTT ATAGTTTTGA GTAGTTAATA GGAGGAAATT 54 0 

CAAAATGACA AAAATGAATG TTGAAAGTTT cAATTTAGAT CaTAcTAAGG TGGTTGCCCC 6 00 

ATTTATTAGA TTAGCGGGAA CGATGGAAGG ATTAAACGGA GATGTCATTC ACAAATACGA 66 0 

25 CATTCGTTTC AAACAACCAA ACAAAGAACA TATGGATATG cCCGGACTAC ATTCATTAGA 720 

ACATTTAATG GCTGAAAATA TTAGAAATCA TAGTGACAAA GTTGTTGATT TAAGTCCTAT 780 

GGGTTGCCAA ACTGGTTTCT ATGTATCATT TATTAATCAT GATAATTATG ATGATGTATT 84 0 

30 AAATATTGTT GAAGCAACTT TAAATGATGT GCTAAATGCT ACTGAAGTGC CTGCTTGTAA 900 

TGAAGTACAA TGTGGCTGGG CAG CAAGTCA TTCATTAGAA GGTGCTAAAA CTATCGCTCA 96 0 

AGCATTTCTA GACAAACGAA ACGAATGGCA TGATGTTTTC GGTACAGGAA AATAAATCTT 1020 

35 

AGTCAATCAA GTTAATCAGA AAaGCAGTCG AACAATGATT TTACAATCGC CATTGTCCAA 108 0 

CTGCTTTTTA TTATGCTTCA AAGTCAAAAA ATCGAACAAA TGAAAAAGTA AAATCTTTAA 1140 

40 CATTTGT CCG ATTTATTTGA GAACCACTAT AATTTCTTAA TTAGTCCCAT TAACACGAAC 1200 

TGCATAGGTA ACCTTAAATA TAGTTGCCAT GTTGGCAATT GTTTATCTCC TAAAGGTAAC 126 0 

TTTTTAACTG CCATATAGAT ATTAGCTGGG AATACAGCTA GCAAGAATAG ATTGATTGTA 1320 

45 TTTTTCAAGC ATTGAGATGG TCTTTTAATT AAAAGTGCAA GTCCAAATAA TATCTCAAAG 1380 

ATTCCTGTAA CAAGAACCGC TGTTTTTCGA AGTGGCAAAC ATTTCGGTAT GATATTTCTA 144 0 

AATTGT CGTT CTCGTGTAAA ATGCAATACA CCTATTACAC TAAAACCTAT TCCTAATAAA 1500 

50 

TATCTTAGTA TGTTCAATCA GCTTCAACTC CTATTCTGTA ATGATTTTAT GAATTAATGT 156 0 

AGGCGATACA ACATGaTCAG CAATTGTTAT GCTTGAATCT AATTTTTTAA CAACAT CGT C 1620 

55 
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AATTTTTTTA TTTAAAACAA TACCAACCGC TAAATCAATA TCATCCTCTT TTGTTAAACG 174 0 

TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 18 00 

ACCTGATTTT TTAGCTTTAT ATTCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 1860 

GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 192 0 

TGCACCTGAA TTAATTGCCT CAATTAGCAA CGCTCTCGCT TCTTCAAGCG TTTCAGCTTT 1980 

GTTTGCAAGT ACAACCATTT GAGAAC CTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 2 04 0 

ACCTTGTCCT TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 210 0 

TCCAAGTGGC TGATTCATAT CAGAAATAAT CGCCATCGTA TTACGTCCCA CATTATTACC 2160 

AATACGTACC ATTGCGTGCG CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 222 0 

GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 2280 

AATTGAAGAG GCAATTAATG GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 234 0 

TAATTTTTTG TCTGCAGGAG TTAAATTTCC TGATTGT CCT ACAACTGCCA CTTTATTTTC 24 0 0 

ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 24 6 0 

TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 2 520 

AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 2 580 

TGTTGAGTGC TTATCTACTT TGACACCTTT AATATCACTC AAATCTATCA TATCACCAGA 264 0 

ATTAACCATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATAT CTT GGAAATAAAT 27 00 

CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AG CCGCCAAT 276 0 

AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 2 820 

TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 28 80 

CTCTAGTCCA TTTTTAAAAG CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 2 94 0 

TTAATATCA C AATTTCAATG CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 3 000 

GTCCCTATTA AATTATCTTA GAATCAT CAT GGCATTTATG ATGTCTTAAA GCTGATATCG 3 06 0 

ACATACTTAT ATATGGTTAC GATGTCC CAT GCTTACATAT TTTTATAAAA TTAGTAATCT 3120 

GAATCTGCTT CTAAACCTTG CATAATTTGa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 3180 

CCTGCTTCAA CCATTTTATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 324 0 

TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTG CAGT CGCACCGCCA 3 3 00 

CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG CCGCTTTTGT TAATTCACTC 33 60 

GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTG TGTGA 34 20 
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TTTAATGCGC CGATGTTGAT GACCATGTCA ATTTCATCTG CAC CATTTTG AATCGCATCT 354 0 

TCTGTTT CAA ATGCTTTCGT CGCAGTTGTC GATGCAC CTA ATGGGAATCC TATTACCGTA 3600 

S CAAACTAACA CCTCTGAATC AGCTAGTCGC TCTGCTGCAT ATTTAACATG CGTTGGATTC 3 66 0 

ACACATACAG ATTTAAAATG GTATGCTTTC GCTTCATCGA TGATTTGATC GATTTGCGTA 372 0 

CGTGTTGACT CAGGCTTCAA TAAAGTGTGA TCAATCAATT TTGCACTATT CATTTTCTAT 378 0 

'° CTCCTCCTTT ATGGTTGATT ATAAAAATAC GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 384 0 

TCATTTAAAT ATCAGGTTAG ATGTT CGCTT TTTATGTAAC CGCATACATA TACTATTACA 3 90 0 

TTAATTCATT TCCCATAAAC AAACAATACA ATTGAACGTG ATATCTTCAT TATGAACGAT 3 96 0 

15 

GACTTGACAA CAAGCTAATC AGGATTATAT TTTTATAATT CTTTAATTCT ATAGTACAAA 4 02 0 

AATTCGCAAA AAAGGGAAAC AAATGTTATC TTAAAATTAT TAATGAATAT TAAGGAGAAG 4080 

20 ATAACAAATG ACAAAAGGTA CACCACATAT TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 414 0 

AGTATTAATG CCTGGCGATC CGCTACGTGC AAAATATATT GCTGATAATT TTTTAGAAAA 42 0 0 

TGTTGAACAA TTTAACGATG TACGTAACAT GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 426 0 

25 AGAAGTTTCT GTAATGGGTT CTGGTATGGG TATTCCAAGT ATTGGTATTT ACTCATATGA 432 0 

GTTATACAAC TTCTTTGATG TAGATACAAT CATTCGTATC GGTTCTTGTG GCGCATTACA 43 8 0 

AGAAAATGTT AACTTATACG ATGTTATTAT TGCACAAGCT GCAT CAACTA ATTCAAATTA 444 0 

30 

TGTAGATCAA TACAATATTC CAGGT CATTT CGCGCCTATC GCTGACTTCG AGTTAGTAAC 450 0 

TAAAGCTAAA AATGTCGCTG ACCAAATCGG TGCTACTACA CACGTAGGTA ACGTATTATC 4 56 0 

TTCTGATACA TTTTACAATG CCGATCCAAC ATTCAATGAT GCTTGGAAAA AAATGGGTAT 462 0 

35 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT ATATTTAAAT GCGATTCATG CTGGTAAAAA 46 80 

AGCACTTGGT ATTTTCACAG TAAGTGATCA TATTTTACGT GACGAAGCTA CTACACCTGA 474 0 

40 AGAACGTCAA AATTCATTTA CACAAATGAT GGAAATCGCT TTAGAAATCG CAGAGTAACT 48 0 0 

TATTTAAATT GACTTTAATT GCTCTTTAAC AATGCGATTA AACTCAAAAA GCCAACACAT 486 0 

TCTGGGCGTA TCCCCATTTA TGTGTTGGCT TTTATTTATA TTATTACTTA TCTGTAGATT 4 920 

45 AGCTTAAGTA AGATTTAAAC ATCCAATTAT GTTTATCTAC TGATGTTTGC ATACCTATAA 4 980 

ACATATCTTC TGATACATCA TCGCCAGCAT TACCAGCAAT TTCGATTGCG TTTTCTAATT 504 0 

GTTTTGAGAT ATTTGTGAAG TCTTGTGATA ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 5100 

SO 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT CTAAGCATTC AGTTAATGTA CCTACAGGGT 516 0 

TTCCTCCTAC CGCTAAAATT CTTTCAGCTA ATTCGTCTAC ATATTGGCTT GCTTCATTAT 5220 

55 
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AATTGTG TAG CTTTGTGTAA G 
(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 444: 

GAAGATGAAA GCTATATTGA AGATGATGTT ACTAAAAAGG AAGCTATTTT AAGTATGCAA 60 

ATTCCTAAAG GTTTCTCTCA AAAATTAAAA GAGAACCGTT TAAAAGAAAC GATACAGTTA 120 

TATGGTAGAG ATGACTTTAT AGGTGGTATT GCTGTAGAAA TTGTTAGTAG TTCATTATAT 180 

GAGCAGCAAA TTCCTAACAT TATTTATGAA CACCTTGAGG ATATGAAACA GCATCAATCC 240 

ATCGATGCTA TCAACAAGTC CTATCATAAA CATACACCTG AATCTAAAAT CAAATTTGTG 3 00 

TCGCTTACTA AACAAGCACA ACACTCTATT TCAATTAGCT TAATCTTTGC GGTGATTTTA 36 0 

TTTGTTAGCG CTGTTCAAGT AGTACTTCAT TATCGTTTAA ACCAACAAGC AG CATTGCAA 4 20 

CGATTATCAC AATATCATTT AAGCCGTTTC AAACTATATA GTACTTATGT AATGACACAT 480 

ACGATTTTGT TATTGTTGGT ACTATTGGCA GTTAGTCTAT ATTTGTCTCA ACCACTCAGC 54 0 

TTAATATTTT ACTTAAAATC ACTGTTACTT ATATTGATTT ATGAGATAGG TATCGTCTTT 600 

ATCTTATTCC ATATTCAAAC AATAAGT CAT CGATT ATT CA TGACATTTAT ATATGCACTT 66 0 

GCTATGGGTA TCGTATACTT GATTATTTTC ATGTAAAGGA GCGTAACTGA TGATAGAAAT 72 0 

TAATAACCTT TCAAAGCGTT ACCGTAACAA ACAGATTTTC AATCATTTAA CTATGTCCTT 78 0 

TGATAGTAAT CGTTTAACCG TATTACTTGG TGATAATGGT GCTGGAAAAT CAACATTACT 84 0 

TCGTATGATT GCTGGTATTG AAAAAGCTAA TGATGGAACT ATCAACTATT TCGGCGAAAA 90 0 

ATGGAATCAA AGACaAATAC aAAATCACAT CGGTTATGTG CCACAAGACA TTGCGTTATT 96 0 

TGAACA CATG ACAGTGGCTG AAAACATTAA ATTTTTTAAA TCACTTTGTA AAAATCCAAT 1020 

TAACGATACA ACTATCAACG AATATTTACA GCAATTAAAC TTTGATGATA CGTCTGCCAA 1080 

AGTATCTACA TTGTCCGGTG GGAATAAACG TAAAATTAAT ATATTAGTAG GTTTACTAGG 114 0 

TCAACCTCGA ATTCTCATTT TAGATGAACC GACAGTTGGT ATTGATTTAA AATCTAGACA 1200 

TGACATCCAC CAACTACTTA ACATCATGAA ATCTAAATGT TTAATTATAT TAACTACCCA 126 0 

TCATTTAGAT GAAGTTGAAG CACTTGCAGA TGATATCAAG TTAATTGGCC AAGATCCTTT 132 0 
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CCCAAGCTGC GTATGATATC GCAACTTGGG ATTTTCTGTA TTATCTACTT TG CAAGT ATG 144 0 

ACGTTGGGTC TACTG CAT AT TGATTACCGA TGCCACCAGA CATACGTTGG AAGTGTACGT 150 0 

GAGGCG CTGT TGAATTACCC GTACTACCTG AATATGCAAT TTGGTCACCA GCTTTGACTT 156 0 

TATCACCAGC TGAAACAGTT AAACGATTAT TATGCATATA CCATTGGTAG TTATTACTGT 16 20 

TCGCTTCTTT AATCGTTACT TGATTGCCGC CACCATAGTT ACTCCAACCT GCTTGTACTA 16 80 

CTGTACCATC AGTTAATGAG TAAACTGGTG AATTTTCAGG CATTGCATAG TCGACACCGT 174 0 

AATGCGCACC ACCACCGTGA TATTGTCCAT ATGGTTGTAG TTGTTTACGA CTTGTTAACC 1800 

AGCTTGCGTC TTTCG CATG A CCACTAGcTG TCGCTTTACT TGCTGATCCA CCATTTTGAT 18 60 

TAGATGTGCC ATTAGGATAA TTGACCTTTC CATTACCATC ATGGCTGTTA TACGCTTGGT 192 0 

TGTTGTTACT ATGTGAATAA TAGCTCGCGT CTGGACCTAC ATTTGATTGA TAACCATATT 1980 

GATTAATATG CTGTTGGCTT TGACTCGCTG TGTAGTCATT GTTATCTCCT GCTGTTGCTG 2 04 0 

GATTCACATA TGTTTGGCCG CTTCCATTGG CATTTGCATT TTTTGGATAA CAGTTATAAA 2100 

AATAATGCGT ATGTCCTTGA GCATCTACGA ATGTATAGCT AT ATT CTTT A TTATCAAACA 2160 

TTGCTTGATT CCAGTTACCA TCAGGTGTGT GATGATAATC CCCATTAGAA TCAATTGTAT 2 220 

AATAAGTACC ATAAGATACG TCTTGTGATT GTGTTGACAT TTGTGTATGT GCTTGTTGGG 2280 

TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT GCGCCATTGT AAATGTAGCG AAsCCATCGT 234 0 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT GTATAAAACA TCCTCCATTA AAGTTAAAGT 24 00 

TAGTTTTCAA TTAAACTGTA C TG CACATAC TAAAAGAATT AGACAACTGA GTAAAGGATT 24 6 0 

TAATTCTCAT TTTC CAACT A TTTAATATTC CCGAAATGTT TTACTAAACT CATTACATTG 2 52 0 

TCATTACAAA ATAGCCATAC ATTGATATTA AAATGACATC TCTCACTGCA TTCGTTTAAC 2 580 

CTTTTATAAA TTTTCAAAAT TAACAACTAA TCGTTCGTCC ATGTTCGCGA TTCAACG CT A 264 0 

ATGCATGATA GTAATCATCC ATCAAATCAT ATCAACCAAA TTCCATTATC AATCGCTATT 2700 

GATTGTCATT CAACTTTCTA ATAGTGATAT GCTT CTCAGG CTTAAAAATC GTCATATCTT 276 0 

TTCTATTAAT TAAATCATCT GTGAGCTTTA ATGCTACTAA TTCATTGCTG CCATAATACT 2 820 

TAATATATAA CGTTCTTGTA GTTAAATTTA TTACGGTCTG ATACATCGTA TAGTGATTGT 2880 

CAGCATCATG CGGACGTACA ATTCCAATCG GTATATTTAC CGCAT CTAAT AAATAAAATG 294 0 

CATTCATTAA ATCCATTTCT TTATCATTGT TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 3000 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC CACCTGGCAA TCCAAATGTA CCTGCTTCAT 3 060 

TGCCTAAAGG TTCAATCGTT ACACCTTCCA ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 312 0 
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CACCAATAGG ATTATCTTTT ATAACCACTT CACCCTCTTT AAATGAAACT TCGACTGTAT 3 24 0 

GTCCAGTTGC ATCGGAAACA TGATAATGCA ATGGCGGAAC TTCACCGATG TCATTTAAAT 3 3 00 

ATACAGCTAC AACATGTATT TGGGATGCTT GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3 360 

ATCCCAAAAT CCATGTCACA ATTTCATTTT GCGTAATATT CATCGCGTCC GCTTTGTGTG 34 2 0 

TTGATCCATA TGAACTATAA CCTCGGAAAT ATTGTGTTGA AATGGCAACG CcATGTtCAT 34 8 0 

TAACACCATC ACCATAAATA AAACCTTCCA TATCTGTTCC TGTGCCAATA AAG CCATATT 3 54 0 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT TCCAACGATA ATTTCTAGGC GTCACTGCTG 3 6 00 

GCGAACCATC TAATGGATAA TCATAATCCA TCGTGCGTCC AAGAAGTACT TGATTATTTA 3 6 60 

AAGTTTGTAT TGTGAATCCT GTGCACATTG TTCTCACTCC TCTGTACCTT CATTTACTTT 3 720 

AATCACTTTC AAATAAAGCT GTTTCACTTA AACATACTAT AAAAAATCAA TTATACAAGC 3 7 80 

AATTAATTGA TATTCATTCT CAATAACTGT GGTATGATAT GTAAGGAAAT CATGACTTAT 3 84 0 

GTGTGAGT GA ACGATCATCT ATACATCCGT TCACTTCATC TCATGACTTT CTATATTTAA 3 9 00 

TTTTTACAAG GAGTGACATC TGTGAATAAC ACACAATCTT CACCACGCAG TAATATTATT 396 0 

ATTGCGATTA TGTTGTCTGC ATTAACATAT TGGTTGTTTG CACAATCATT TATTAATATA 4 02 0 

GGACCTCTCG TTGGTCAAAC ATATCAAACC TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4 080 

TTAACTTCCT TCGCCACAGG TATCTTCATG GTGGCTGCAG GTGATATTGC TGATAAAATA 414 0 

GGACAACTGA GAATGACATA CATGGGTCTC ATAATCAGTA TGTTTGCATC TCTTCTATTA 42 00 

ATTATATCGG ACATCACTGC ACTG CTCATC ATCGGTAGAA TTTTACAAGG TCTATCAGCA 42 60 

GCTAT CTTGT TACCTTCAAC AGTTGGCGTG TTAAATAATC AATTTAAAGG AGAACATTTA 432 0 

AGACGAGCGA TTAGTTATCT AATGATTAGT ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 4 3 80 

ATCGGCGGTT TAATTGCCTC AAATTTCGGA TGGCAAACGA ATTTCATCAT TAGTATAGTC 444 0 

ATTGCTTTCA TTGCCATATT GCTTCTAAAA GGCACACCTG AAAAAGTAAG TCAACATAGC 4500 

CACCGTCATC CATTCGATTA CAAAGGTATG TCGATTTTCG CTGTTATGAT TGGTAGCTTT 4 560 

ACATTATTGT TAACACAAGG ATTCGAACAA GGTTGGTTTA GTACATTTTC AATCATTTGT 4 6 20 

CTGAGCATTT TTATCaTCAc TACGTTGATA TTCATCATCA TCGAACGTCG ACATGAAGTA 4680 

CCTTTTATTG ATTTCTCAGT ATTACG CAAC CGTCCGTTCA TTGGTGCATT TTTAAATAAC 474 0 

TTTGTTTTAA ATAGCGGTCT AGGCGTAACA GTGGTCTTTT TCATATATGC TCAAACACAC 4 800 

CTTGGTTTAC CAgcTGCGCA ATCTGGACTT GTTACATTGC CATATGCCAT TGTGGCAGTT 4 860 

GCGATGATTC GTTTAGG TG A AAAAGCAACA TTACGTTTCG GTGGCAAATT GATG CTCATC 4 920 
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TCACAATATG 


TCATTGCAGT 


TATCATTGGT 


TTCGTCATAT 


GTGCGATAGG 


TAATGGTTTA 


5040 




GTCGCAACAC 


CTGGACTTAC 


GATTGCAATT 


TTCAGTATGC 


CTAATGAAAA 


AGTTGGTTTA 


5100 


5 


GCTACAGGAT 


TATATAAAAT 


GAGTGGTACA TTAGGTGGCT 


C CTTTGGT AT 


AGCACTAAGT 


5160 




ACTACAGTTT 


TCAGTATGTT 


ACAACTAAAC 


TATGCACCAA GTGTAGCTGC AACCGTAACA 


5220 


10 


TTTATAGTCA 


GCATTGTATT 


GATGATCCTT 


GGCTCATTGT 


CTGCATACAT 


GATCATTCCA 


5280 


AAAACAGTTA 


AATCTTAAAT 


ATAATAGAAG 


AATTATGTTT 


CGAAATATCT 


TTATCACTTT 


5340 




AAAATGATAT 


ACAAGAAATC 


CAAGAAAAAT 


AAGCGAACTG 


AATAAATAAA 


GATT CAATTA 


5400 




ACGCAT CAGT 


ATTAGGATTC 


ACTCTAAAAC 


GATTAATAGT 


TTTATAAGAA 


GGTGTTTGAT 


5460 




CTTGAGCTAA 


CCACATCATT 


CGAATACTGT 


CATGAAGTAA 


TTTCTCTATT 


CTACGACCAG 


5520 




AAAATACAGA 


TTGAGTATAT 


GCATATAAGA 


TGATTTTTAA 


CATCATCTTT 


GGATGATAGG 


5580 


20 


ATGTTGCGcC 


ACGATGATGT 


CTGAATTCAT 


CGAATTTGCT 


ATCAGGTATC 


GTTTCAACAA 


5640 




TTTCATTAAC 


ATGTCGCGAA 


AT AT CATTTT 


GAGGAATTCT 


AACAGAAGTT 


TTTATTGGTA 


5700 




GTGTAAGTTG 


GGCAAAGTGT 


CTTATTTTTT 


TAAAGTATTT 


CAAAGTAAAA 


TTACATGTTA 


5760 


25 


ATACGTAGTA 


TTAATGGCGA 


GACTCCTGAG 


GGAGCAGTGC 


CAGTCGAAGA 


CCGAGGCTGA 


5820 




GACGGCACCC 


TAGGAAAGCG 


AAGCCATTCA 


ATACGAAGTA 


TTGTATAAAT 


AGAGAACAGC 


5880 




AGTAAGATAT 


TTTCTAATTG 


AAAATTATCT 


TACTGCTGTT 


TTTTAGGGAT 


TTATGTCCCA 


5940 


30 


GCCTCTTACT 


CTAATTATAT 


TCACTATCAA 


TTAGACAAAA 


TGGCCATTTT 


CAAATATCAC 


6000 




GCGTTGTTTC 


TGACCTTGAA 


TATATTTATT 


ATAATTCTCT 


TTTTGAAAAT 


CAGTTAACAT 


6060 




TAATTTAGAT 


GTACCGTATT 


TTAACACTTT 


TTGCATTGTT 


TCTATTCTCA 


TTTTTCTAAA 


6120 




TAACCATCCA 


TCTTTTAACA 


CAATACGATT 


AACAGCATCA 


TATGATAATT 


CTACTGTTTC 


6180 




TTTAATTTCA 


AATGTCTTGA 


ATGAAATAAT 


CGTGCACATT 


AAAAACGTAT 


CACCAAAGTA 


6240 


40 


AT AAA CAT CT AAATCATCAC 


GTTTATGTTG 


TCCAACTAAC 


AAACGACCAT 


ATTCGAACTC 


6300 




TTTTTCTGGA 


TATTTCAATT 


CTAAAAAACT 


AATAATCTCT 


TCTTCTTTTA ATTTGAATTG 


6360 




CATTTAAAAA 


CATCCTCTCT 


TAAGTTTTAA 


CAAG CCTTAA 


TTAAAAATTT 


TTTCAATCAC 


6420 


45 


ATAGTTCAAT 


ATACATCATT 


TCGTTATGTT 


TTTTAATACT 


TTGTT CAAAA 


ACAAATATTT 


6480 




TATTCTTTAA 


AATAATGACT 


TTTGTATTTT 


TAATCACAAT 


AAACATTTTA 


AAATTCTTGT 


6540 




TATCATAATC 


ATTAAAAGGT 


ATTAACCTTA 


ATAATATATT 


CTCTCGTCTC 


AACCTTAATC 


6600 




GTATACTTCA 


GACGTCTGTT 


TGTAGACAAT 


AAAAGTCATT 


CACGTCTTCA 


TATGTCATCA 


6660 




AATGTTTATC 


ATGATATGAT 


GAATATAATA 


ATCGGGTATA 


TAACTGTATG 


ATTAATTACA 


6720 



55 
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TGGTGTATCA GTTA CAACTG TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC 684 0 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6 90 0 

5 TGCAAAACAA TTGCGCGGCA GTAAAATTCA AACTATT GG C GTCATTTTGC CTAGCTTAAC 6 9 60 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAACCAT CTGATGTTGA 7020 

TTTATGCTTT TTAACATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 7080 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 7140 

CTATCTAAAG AAACATCATG TACCTTATGT CGTACTGGAT CAAAATG AC C AT CAAGG CTA 720 0 

^ TACAGATTTT GTTCGGACAA ATGAATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATAT GTC 7320 

GACTCGTGTC GCTGGATTTG TCGATACTTT GCGCGCGAAT CAATTGCCAG AACCACAAAT 73 80 

20 CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 74 40 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGGCATTT TACGAGGACT 7500 

AATTGAACAT GGCATCAGTA TCCCGAAAGA TATCTCATTA ATAGGTTATG ACGACATTGA 756 0 

25 TTATGCAGCG TACGTCTCGC CACCTTTAAC TACTGTGGCA CAACCTATAA CTGATATTGG 7620 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAGCAC TTAGATAAAT CCATTGATAT 7680 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 774 0 

30 

ACTACGTATC TTCCGAAATA TACTCATCAT TGTTAGGCCC TTAGCGTTGC TTTAATGCTG 7800 

AGGGTTTTTA ATCATAATTA TTTTACTAAG AAATTAAAAT AATAATGTAT GAATTTTTAA 7860 

^ ATATGATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7 920 

ACGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 7980 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTTCTTCC CAACAGTCGC 804 0 

40 TTCAAAGTTT GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACGTTAAT 8100 

CTTC GCTATT TTATTAGCCG TAGTCACATC AAGTGGCTTC CCTACTGGAA CCAATTTGCT 8160 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATT CGGACAA ATCATTACAT TTAAAGCGTT 8220 

45 CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 82 80 

ATCTTTATGG GGTGTCTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 834 0 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 84 00 

50 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 84 60 

TGGATACTGG TTATATTCAG CTGCACCGCA AG CAACTT CT ATTGATGGCC TAACTGCCTT 8520 

55 
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AGCAGAGAAT CCATTCCGTA ATAAAATTAC GTGGTTACAA ATTATTTCAG GTTTCTTCTT 864 0 

TGCATTTGGT GCTTTAACAT ATCTTATTTC AGCACAACCT AATATGAATG GTTTAGCAAC 870 0 

TGGATTTATT CTTTCTCAAA CATCCGTTGT GCTTGCTACA TTAACTGGTA TTTATTTCTT 876 0 

AAAACAACAT AAAACGTCAA AAGAAATGGT TATTACAATC ATCGGCTTAG TACTCATTTT 8 82 0 

AGTAGCCGCT TCTGTTACAG TATTTATAAA ATAAGGAGTG TAGATGTCAT GAAAAAATCA 888 0 

GCTGTTTTAA ATGAACATAT TTCAAAAGCA ATCGCGACAA TTGGTCATTT TGATTTATTA 894 0 

ACGATTAATG ACGCTGGCAT GCCAATTCCA AATGAT CAT C GTCGTATCGA CCTAGCTGTA 9000 

ACTAAAAACT TACCACGCTT TATTGATGTC TTAGCTACAG TGTTAGAAGA AATGGAAATC 906 0 

CAAAAAATAT ACTTAG CAG A AGAAATAAAA GAACATAACC CTACACAATT GCAACAAATT 912 0 

AAACAATTGA TTTCATCGGA AATCGAAATC ATTTTCATTC CTCACGAAGA AATGAAAAGT 91 BO 

AACTTAGCTC ACCCATTAAA TAAAGGTAAT ATTCGTACTG GTGAAaCAAC GCCCTACTCT 924 0 

AATATTGCAT TAGAATCGAA TGTTACTTTT TAAAAGTTAT AACTTGAAAG GAGCGTACAC 93 0 0 

ATGACCAACA AAGTTGTTAT TTTAGGTTCA ACGAATGTCG ATCAATTTTT AACAGTTGAA 936 0 

AG ATATG CAC AACCAGG CGA AACATTACAT GTTGAAGAAG CACAAAAAGC ATTCGGCGGA 94 2 0 

GGTAAAGGTG CCAACCAGGC TATTGCCACT GCACGCATGC AAGCAGACAC AACATTTATT 94 80 

ACTAAAATTG GCACTGATGG CGTTGCTGAT TTCATCTTAG AAGATTTTAA AGTAGCTCAT 9 54 0 

ATTGATACAT CATATATTAT CAAAACAGCT GAAGCAAAAA CGGGCCAAGC CTTTATCACT 96 00 

GTGAATGCAG AAGGACAAAA CACCATCTAT GTTTATGGTG GTGCGAATAT GACGATGACA 966 0 

CCTGAAGATG TTATTAACGC AAAAGACGCT ATAAT CAATG CAGACTTTGT CGtTGCACAA 972 0 

TTAGAAGTAC CCATCCCGGC T ATT AT AT CT GCATTTGAAA TTGC CAAGG C ACATGGTGTG 9780 

ACGACAGTAT TAAATCCTGC ACCAGCGAAA GCATTACCTA ATGAATTATT ATCATTAATC 9 84 0 

GATATTATTG TGCCAAACGA AACAGAAGCC GAATTGTTAT CTGGGATTAA AGTAACTAAT 9900 

GAACAATCTA TGAAAGACAA TGCCAATTAC TTTTTATCTA TAGGCATTAA GACTGTTTTG 9 960 

ATTACGCTAG GTAAG CAAGG TACATATTTT GCTACTAAAA ATCAAAGCCA ACACAT CGAA 10 020 

GCTTATAAAG TAAATGCGAT TGATACAACT GCTGCAGGCG ACACATTTAT TGGTGCATTT 10 0 80 

GTCAGTCGCT TAAACAAGTC GCAAGATAAC TTAGCAGATG CTATTGATTT TGGTAATAAA 1014 0 

GCGAGCTCAC TCACTGTACA AAAACACGGC GCGCAAGCAT CTATTCCTCT ACTAGAAGAA 10200 

GTAAATCAAG TTTAAATGAA TCAAACACAG CTATGATATG AAGGTTTAGC ATATAACATG 10260 

CAACATTCGT ATATCATGGC TGTGCTTTTT TATCTTTATA AAACATCATC TATTAGAAAT 10320 
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TTTGTAATCT 


TTTTAACTTC 


CAAATTaTCG 


CATATAAATA 


TGCTATATTA 


ATGATAATAA 




TTATCAATTA AAAGGAGGTT 


ATGCTATGTC 


TAAAGAAGCT 


GGTCATACAT 


TTTTAGCTAA 


10500 


ATTAGGAAAA 


ACTCGTCTAC 


GCCCCGGTGG TAAAGAAGCA ACAGATTGGT TAATACAACA 


10560 


AGGGGCATTT 


TCACAAGATA 


AACAAGTGTT 


AGAAGTGGCA 


TGTAATATGT 


GCACAACATC 


10620 


TATTTATCTA 


GCTCATACAT 


ATGGCTGTCA 


CATTCAAGGC 


GTTGATATAA 


ATAAGAAAGC 


10680 



ATTAGAAAAA GCACAGGAAA ACATTTCAGC AGCAGGTCTT GAATCATATA TTCAAGTTCA 1074 0 

ACAAGCGAAT GCTGTTAAAT TGCCCTTTGA TGACAAT CAA TTCGATATCG TTTTAAATGA 10800 

1S AGCAATGTTA ACAATGTTAC CCATCGCCAT AAAGGAAAAA GCATTACGCG AGTACTACCG 10860 

AGTCTTAAAG CCTGGGGGTA TCTTGTTAAC ACATGATATT GTCATCGTTA ATGAATCACA 10920 

TGCCACACAT GTTGTTAAAT CATTATCTGC AGCAATTAAT GTCAATGTCT CACCGCAGAC 1098 0 

20 GAAACTTGGC TGGTTAGATT TATATAATCA AG CTGGTTTT AATCATGTGC ATTATCATAC 11040 

TGGTCCAATG AGTTTAATGA CACCAAAAGG TTTAATTTAT GACGAAGGTA TTGTTGGAAC 11100 

TATAAAGATT ATCAACAATG CTTTGAAAAA AGAAAATCGA CCAATGTTTT GTAAAATGTT 11160 

25 TAAAACGATG ACTAAATTGC GTAAAGATAT GAATTATATT ACTTTTGTCG CTAAAAAAGA 11220 

GCACTAAATA TAATGCCACT AACTGTACTT TGTATCTATG TTTGACTATC ACTTTAATTT 1128 0 

CTTTGTGACA CTAATCATCT ACTTAACAAT ATCGTTATCG TTGATTAGTA AGTCATCAAT 11340 



TTTGGTTAAA GACTTTCATA AACACTCAAA CATTAACACT ATACATAGTT AGTGGcATTA 11400 
TTTTTTyCTn AAAATTTTAA CmTCmCGGGr TCGGGAmCrG AAaTGrtAwT TcGCrmAAtT 11460 



w 



30 



35 



TAwTcT 



11466 



(2) INFORMATION FOR SEQ ID NO: 445: 



40 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 44 5: 



TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 



60 



ATTTGTATTC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 



120 



50 



AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 



180 



CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 



240 



55 
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AAGTATTACA TCTAAAAGGT ATCACAGAAG AACAACGTAA CCAATACATC AAAACATTAC 360 

GCGAACACCC AGAACGTGCA CAAGAAGTAT TCTCTGAATC ACT TAAAG AC AGCAAGAnCC 420 

CAGACCGACG TGTTGCACAA CAAAACGCTT TTTACAATGT TCTTAAAAAT GATAACTTAA 4 80 

CTGAACAAGA AAAAAATAAT TACATTGCAC AAATTAAAGA AAACCCTGAT AGaAGCCAAC 54 0 

AAGTTTGGGT AGAATCAGTA CAATCTTCTA AAGCTAAAGA ACGTCAAAAT ATTGAAAATG 600 

CGGATAAAGC AATTAAAGAT TTCCAAGATA ACAAAGCACC ACACGATAAA TCAGCAGCAT 660 

ATGAAGCTAA CTCAAAATTA CnTAAAGATT TACGTGATAA AAACAACCGC TTTGTAGAAA 72 0 

AAGTTTCAAT TGAAAArGCA ATCGTTCGTC ATGATGAGCG TGTGAAATCA GCAAATGATG 780 

CAATCTCAAA ATTAAATGAA AAAGATTCAA TTGAAAACAG ACGTTTAGCA CAACGTGAAG 84 0 

TTAACAAAGC ACCTATGGAT GTAAAAGAGC ATTTACAGAA ACAATTAGAC GCATTAGTTG 900 

CTCAAAAAGA TGCTGAAAAG AAAGTGGCGC CAAAAGTTGA GGCTCCTCAA ATTCAATCAC 96 0 

CACAAATTGA AAAACCTAAA GTAGAATCAC CAAAAGTTGA AGTCCCTCAA ATTCAATCAC 1020 

CAAAAGTTGA GGTTCCTCAA TCTAAATTAT TAGGTTACTA CCAATCATTA AAAGATTCAT 1080 

TTAACTATGG TTACAAGTAT TTAACAGATA CTTATAAAAG CTATAAAGAa AAATATGATA 114 0 

CAGCAAAGTA CTACTATAAT ACGTACTATA AATACCAAGG TGCGATTGAT CAAACAGTAT 1200 

TAACAGTACT AGGTAGTGGT TCTAAATCTT ACATCCAACC ATTGAAAGTT GATGATAAAA 126 0 

ACGGCTACTT AGCTAAATCA TATGCACAAG TAAGAAACTA TGTAACTGAG TCAATCAATA 1320 

CTGGTAAAGT ATTATATACT TTCTACCAAA ACCCAACATT AGTAAAAACA GCTATTAAAG 13 80 

CTCAAGAAAC TGCATCATCA ATCAAAAATA CATTAAGTAA TTTATTATCA TTCTGGAAAT 14 4 0 

AATCAATCAA AAATATCTTC TCTAGTTTTA CATCATTTTT TAAATAATTT TCGTAACAAA 1500 

CCGTGATTAA AAAGAACCGT TGATTCTCAA TCGAATCTAC GGTTCTTTTT TCATTTTCCA 1560 

TCAATTAAAT GCTTCTTCGC TATTTGTCAG CCCACTTTTT TACCTGCAAC TTGTTAAATA 1620 

ATCCTTACAT CGTTAACGAA TAGTTCATCA TTTAGTTGAA TCAGCTCAAC TTTATTAACT 1680 

TCATATTTTC ACAAACTATT GCGCAATCCA TTCCTTTTCC ACTACAAGCA CCATAATTAA 174 0 

ACAACAATTC AATAAAATAA GACTTGCAAA GCATAGTTAT GTAGCTATAT AAACGCCTGC 1800 

GACCAATAAA TCTTTTAAAC ATAACATAAT GCAAAAACAT CATTTAACAA TGCTAAAAAT 18 6 0 

GTCTCTTCAA TACATGTTGA TAGTAATTAA CTTTTAACGA ACAGTTAATT CGAAAACGCT 1920 

TACAAATGGA TTATTATATA TATGAACTTA AAATTAAATA GAAAGAAAGT GATTTCTATG 1980 

ATTAAAAAtA AAATATTAAC AG CAACTTTA GCAGTTGGTT TAATAGCCCC TTTAGCCAAT 204 0 
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CmTyCAAArG AcACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCTTCCATT 2160 
GGATTTGGTA AAGGnT 2176 
(2) INFORMATION FOR SEQ ID NO: 44 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

IS 

AAAAGCATGG CTTAAATGAA GTACGCTATA ACAAATTACA AGAACATGCT ATTGTTATGC 60 

ATCCGGCACC TGTGAATAGA GGAGTAGAAA TACAAAGCGA TTTAGTAGAA GCTTCAAAAT 120 

20 CAAGAATTTT TAAGCAAATG GAAAATGGCG TTTACTTAAG AATGGCAGTC ATTGATGAAT 180 

TATTAAAATA GGTAAGGGGA CGAAAATGAT GAAATTAATT AAAAACGGTA AAGTATTACA 24 0 

AAATGGCGAA TTACAACAAG CAGATATTTT AATTGATGGT AAGGTAATTA AACAAATTGC 3 00 

25 ACCTGCAATT GAACCAAGCA ATGGTGTTGA CATCATAGAT GCGAAAGGTC ACTTTGTGTC 360 

ACCTGGATTT GTCGATGTTC ATGTTCATTT ACGTGAACCT GGTGGTGAAT ATAAAGAGAC 420 

AATTGAAACT GGTACTAAAG CTGCTGCTAG AGGCGGATTT ACAACTGTAT GTCCAATGCC 4 80 

30 TAACACAAGA CCGGTACCAG ATTCTGTAGA ACATTTTGAA GCTTTACAAA AATTAATCGA 540 

TGACAATGCT CAAGTACGTG TATTACCTTA TGCTTCAATT ACAACACGTC AATTAGGTAA 600 

AGAATTGGTT GATTTCCCAG CACTAGTAAA AGAAGGTGCC TTTGCGTTTA CAGATGACGG 660 

TGTAGGAGTA CAAACTGCAA GCATGATGTA TGAAGGCATG ATTGAAGCTG CAAAAGTAAA 720 

CAAAGCCATC GTAGCACACT GTGAAGATAA TTCATTAATC TATGGTGGTG CAATGCATGA 780 

AGGGAAACGC AGTAAAGAGT TAGGTATACC AGGTATTCCA AACATTTGTG AATCTGTTCA 840 

40 

AATCGCAAGA GATGTACTAT TaGCTGAAGC AG CAGGTTGT CATTATCATG TATGTCATGT 900 

TTCTACTAAA GAAAGTGTTA GAGTCATTCG TGACGCTAAA CGCGCAGGCA TTCATGTTAC 960 

4S AGCTGAAGTT ACACCACACC ATTTATTGTT AACAGAAGAT GATATTCCTG GTAATAATGC 10 20 

CATTTATAAA ATGAATCCAC CATTGAGAAG TACTGAAGAT AGAGAGGCTT TGTTAGAAGG 10 80 

GTTACTAGAC GGTACAATTG ACTGTATCGC AACAGAcCAT GCACCACATG CACGTGATGA 1140 

so AAAAGCACAA CCAATGGAAA AAGCaCCATT CGGAATTGTT GGTAGTGAAA CAGCATTCCC 1200 

ATTATTATAT ACGCATTTTG TAAAAAATGG TGATTGGACA TTACAACAAT TAGTAGATTA 12 6 0 
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TTATG CAGAT TTAACAATCA TTGATTTAGA TAGTGAACAA GAAATTAAAG GAGAAGATTT 13 80 

CTTATCAAAA GCAGATAATA CACCATTTAT CGGCTATAAA GTTTATGGAA ATCCGATCTT 144 0 

5 AACAATGGTT GAAGGCGAAG TTAAATTTGa GGGGGATAAa TArTATGCAA gCAAACGTTA 1500 

TCTAGTGTTA GAAGACGGTC TTTTACGAGG CTACCGTTAG GTCTGATAAC TTACTGA 1557 
(2) INFORMATION FOR SEQ ID NO: 447: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 



20 


GCTAGAAATm 


TTGmATGaCA 


ATACAACTCT 


GTTAAAaTGA 


TGGACGTAGA CAAATATGCG 






TATTGACGCT 


TTATTTTAAA 


AATTaACATG 


CTTATAACAT 


GTTTATAGAA GGAGATTAAC 






CTATGAACTA 


TCaAGTTCTT 


TTATATTATA 


AATATATGAC 


GATTGATGAC CtGAACAGTT 






TGCTCAGGAT 


CACTTAGCCT 


TTTGTAAAGC 


ACACCATTTA 


AAAGGTAGAA TTCTTGTTTC 






TACAGAAGGT ATTAACGGCA 


CATTATCTGG 


TACAAAAGAA 


GAAACCGAAC AATATATGGC 






ACATATGCAT 


GCCGATGAAC 


GATTCAAAGA 


TATGGTGTTT 


AAAATTGATG AAGCTGAAGG 


360 


30 


ACATGCTTTT AAGAAAATGC 


ATGTACGTCC 


TCGAAAAGAA 


ATCGTTGCTT TAGATTTAGA 


420 




AGATGACGTC 


GATCCAAGAC 


ACACAACTGG 


CCAATATTTA 


TCACCTGTAG AATTTAGAAA 


480 




AGCTCTTGAA GATGATGACA 


CAGTCATTAT 


TGATGCACGT 


AATGATTATG AATTTGATTT 


540 


35 


AGGTCATTTC 


CGAGGTGCAA 


TTCGT CCAAA TATCACACGT 


TTTAGAGATT TGCCTGACTG 


600 




GATTAAAGAG 


AATAAAGCGT 


TATTTGCAGA 


TAAAAAAGTG 


GTTAOGTACT GTACTGGTGG 


660 


40 


CATTCGATGC 


GAAAAATTTT 


CTGGATGGCT 


TTTAAAAGAA 


GGTTTCGAAG ATGTAGCTCA 


720 


ACTTCATGGC 


GGTATTGCTA 


CATATGGTAA 


AGATCCTGAA 


ACAAAAGGTG AATATTGGGA 


780 




CGGTAAAATG 


TACGTATTTG 


ATGACCGTAT 


CAGTGTTGAT 


ATCAACCAAG TTGAAAAnAC 


840 


45 


AATTATTGGT 


AAGGATTGGT 


TTGATGGCAA 


ACCATGTGAA 


CGTTATATTA ATTGCGCTAA 


900 




CCCAGAATGT 


AATAAACAAA 


TATTAGTTTC 


TGAAGAAAAC 


GAAACTAAAT ATTTAGGTGC 


960 




ATGCTCTTAT 


GAATGTGCTA 


AACATGAGCG 


TAATCGTTAT 


GTTCAAGCAA ATAATATTAG 


1020 


SO 


TGATAATGAG 


TGGCAACAAC 


GTTTAACAAA 


CTTTGATGAT 


TTACATCAAC ATGC TTAGTT 


1080 




TTAATTAAAT 


ACCTTTCAAA 


ACACGCTTTG 


AAAATCCGAT 


TTATAAAGGT TTTTCAAGGC 


1140 



55 
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TAAATTTTAA 


TACTGCGGGG 


TGTCTTAAAA 


TGCACATTTT 


AGTAACAGGG 


TTTGCGCCTT 


1260 




TTGACAATCA 


AAATATCAAT 


CCCTCATGGG 


AAGCTGTGAC 


TCAACTAGAA 


GATATTATTG 


1320 


5 


GCACACATAC 


AATCGATAAA 


TTAAAACTAC 


CAACCTCTTT 


TAAGAAAGTA 


GATAATATTA 


1380 




TAAATAAAAC 


GTTGGCATCT 


AATCATTATG 


ATGTTGTACT 


AGCTATAGGA 


CAAGCTGGTG 


1440 




GTAGAAATGC 


CATTACCCCA GAACGTGTCG 


CCATTAATAT 


TGATGATGCA 


CGTATTCCAG 


1500 


10 


ATAATGATGA 


TTTTCAACCT ATTGATCAAG 


CCATTCACTT 


AGACGGTGCG 


C CAG CTTATT 


1560 




TTTCAAATTT 


ACCaGTTAAA 


GCAATGACTC 


AAAGTATTAT 


TAATCAAGGA 


CTTCCTGGAG 


1620 


15 


CACTTTCAAA 


TAGCGCAGGT ACATTTGTTT 


GTAATCACAC 


ACTTTATcAC 


TTAGGTTATT 


1680 


TACAAGATAA 


GCATTACCCT 


CACCTACGAT 


TCGGATTTAT 


TCaTGTGCCA 


TACATACCAG 


1740 




AGCAGGTcAT 


TGGTAAACCC 


GATACACCAT 


CTCATGnCCA 


TTGAGGAAAA 


GATnAGTTG 


1799 


20 
25 


(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 










(xi) ! 


SEQUENCE DESCRIPTION: £ 


SEQ ID NO: 448 : 






30 


ACTTGGTTTT 


TTATTGTTTA 


TAAATAAAAC 


TCACTTAATA 


ATGTTTTCAT 


AATCTTCTTC 


60 




GACTACTTAA 


TTCTTTAAGA 


TATTCGTGAA 


AAGAGACATT 


ACACTAGTTA 


ATTTTCAAAC 


120 




AATACAAAAA 


GCGTCTACCT 


CCTACATATA ATTGTAGCGG 


AGATAGACGC 


TTAATATTTA 


180 


35 


TTTAAAAATT 


ATTTTAAACC ACCGAATGTC 


ATAACATCAC 


GGGCAATCAT 


ACTTTCTTCA 


240 




TCTGTTGGAA 


TAACGACAAC 


TTTAACTGGT 


GAATGAGGAT 


AGTTAATAAA 


TCCTTCTTTA 


300 


40 


CCACGTAGTA 


AGTTTTCATT 


TTTCTTAGGA 


TCCCAGTAAA 


CACCCATAAA 


TTCTAAGCCT 


360 




TCAAGAACTT 


TCGCACGAAT 


TTCTACTGAG 


TTTTCACCGA 


TACCTGCTGT 


AAATACGATA 


420 




ACATCAACAC 


CATGCATTCT 


CG CAG CAT AT 


GATCCAATAT 


ATTTGTGAAT 


TTTAGAAGCA 


480 


45 


AATACATCTA 


AAGCCATTTG 


TGAACGTGCT 


TTACCTGATT 


CAGCTTCTTC 


TGATAAGTCA 


540 




CGTAAATCAC 


TAGATGTACC 


TGATAATCCT 


AATAAAC CTG 


ATTCTTTGTT 


TAAGATTTC C 


600 




AATACTTGTT 


CAGCAGTTTT 


AC CTGTTTTT 


TCCATAATAA 


ATGGAATTAA 


AGCAGGGTCA 


660 


50 


ATATTACCAG 


AACGAGTACC 


CATTGTTACA 


CCAGCAAGTG 


GTGTGAAGCC 


aTTGATGTAT 


720 




CAATAGATTT 


ACCGCCATCG 


ATAGCTGCAA 


TTGATGCTCC 


ATT AC CAATG 


TGACATGAAA 


780 



ss 
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TATGGCTTGT ACCATGGAAA CCATACTTAC GAATGCCATA ATCTTTATAA TAATGATATG 90 0 

GCAAGCTATA TAGATATGCT TTTTCAGGCA TTGTTTGATG GAATGCTGTA TCAAAAATTG 960 

5 CCACATGAGG GATATTTGGT AATAATTTAC GGAAAGCACG AATACCCAtC AAGTTaGCTG 102 0 

GGTTGTGaAG CGGTGCTAAT TcGCTTAATT CTTCAATTTC CTTTTCAACC TCAT CAGTAA 1080 

TAGCTACTGA TTCAGGGAAT TTTTCACCAC CATGTACAAC ACGGTGACCT GTTCCATCGA 114 0 

w 

TATCGTTAAT ATCATTAATA ATATTG TGCG CTTTAAAAGC ATCCAACATG ATATCAACTG 120 0 

CCTCAACGTG ATCCTTGATA TCTTG TACTG TTTTAACTTT TTCCCCGTTG ACTTCAATTG 126 0 

TAAAAATTGA ATCCTTCAAT CCGATTCTTT CTACTAAACC TTTTGTTACT AATTCCTCTT 132 0 

15 

CAGGCATTCT AATTAATTGA A 1341 

<2) INFORMATION FOR SEQ ID NO: 44 9: 

2Q (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 





TTTTGAAGAT 


ACTACCGATG 


AAAATAGACA 


AAAGATTTTT 


CAATATTTAT CACCTGAAgA 


60 


30 


AGTTGCAAAT 


TTCTTTGATC 


AATTAGATAT 


TGATGACGAT 


GAATATGAGT 


TGCTATTTGA 


120 




TAAGATGAAT 


GCGACATACG 


CAAGTCACAT 


ATTAGAAGAA 


ATGTCATACG 


A CAATGCAGT 


180 




AGATATTTTA 


AATGAGTTGA 


CTAAACCAAA 


AGTTGCTAGT 


CTTTTAACAT 


TGATGAATAA 


240 


35 


AGATGACGCG 


AATGAAATCA 


AAGCATTACT 


TCACTATGAT 


GAGGATACGG 


CCGGCGGTAT 


300 




TATGACGACG 


GAgTATTTAT CACTTAAAGC GCATACGCCT 


GTTAAAGAAG 


CATTATTATT 


360 


40 


GGTCAAAGCG 


CAaGCACCAG 


ACGCAGAAAC 


AATATATGTT 


ATATTTGTCG 


TTGATGATGA 


420 


TGGTAAATTA 


GTAGGTGTTT 


TATCG CTAAG 


AGATTTAATT 


GT AG CTGAAA 


ATGATGCTTA 


480 




TATTGAAGAT 


ATTATGAATG 


AACGTGT CAT 


TAGTGTGAAT 


GTAGCAGACG 


ACCAAGAAGA 


540 




TGTTGCTCAA 


GTTATGAGAG 


ACTATGATTT 


CATGGCTGTA 


CCTGTTATAG 


ATTACCAAGA 


600 




ACATTTGCTT 


GGTATCATCA 


CGATTGATGA 


TATTTTAGAC 


GTTATGGATG 


AAGAGGCTAG 


660 




TGAAGACTAC 


TCTCGTTTAG 


CCGGGGTATC 


AGATATCGAT 


TCGACTAATG 


ATTCAATCAT 


720 


50 


TAAAACAGCA 


TTAAAACGTT 


TACCATGGTT GATTATTTTA 


ACATTTTTAG GAATGATTAC 


780 




TGCGACAATT 


TTAGGGAGAT 


TCGAAAAAAC 


ATTAGAAAAT 


GTAGCGCTAC 


TCGCAGCGTT 


840 
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TCGTAACATT ACGAcAGGGG AAATTAATGA GCAAAGTAAA TTTAGAATTG CATTAAGAGA 9 60 

AGCAGGAAGT GGTGTATTAT CGGGTGTTGT ATGTTCAACA ATATTATTTA CAATTATTGT 10 20 

TGCAATATAT CATCAGCCAC TTTTAGCATT AATCGTTGCA GGAAGTTTAA CTTGTGCGAT 1080 

GACGGTGGGG ACGTTTGTAG GTTCGATGAT TC CATTATTG ATGAATAAAT TAAATATCGA 114 0 

TCCAGCAGTG GCTAGTGGAC CATTTATTAC AACAATTAAT GATATTATTA GTATGTTGAT 1200 

TTATTTTGGT TTAGCTACAT CATTTATGGC TTACTTAATT TAAGGAGGAG TTATGGAGTT 12 6 0 

TTTATCTTTA GTTATTGTTG TTTTAGCAGC GTTTTTAACT CCAATAATTG TCAATCGATT 13 20 

AAATATTAAT TTCTTGCCAG TTGTTGTTGC AGAAATTTTG ATGGGGATTG T GATTGG AAA 13 80 

TTCATTTCTA AATATAGTAG AAAGGGATTC AATTCTAAAT ATTTTATCAA CGTTAGGCTT 144 0 

TATCTTTTTA ATGTTTTTAA GTGGTTTAGA AATTGATTTT AAAGCTTTTA AAAAAGATAA 1500 

ACGCGCACGT CAAGGACAAA ATGATGATG 152 9 
(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 

TTCTGGAAAC CAAAGTATTG TCATCTTCTA CTAGTAGTAT nGGCATCCAT ATCACCCAAT 6 0 

ATCATTTAGT ATATTTCATA TTTTCTCCTG ATTTTAATCG ACTTTGAAAA TCTTTAATCC 120 

GGCAGTCAAC TTCAAAGCCA TGAATCATCA TTTTTTGCAT TGGTGCTTGT ATAAAGTAAT 180 

AAATCGGCCA AAATAATCGA GGGATATAAT CGTATAGATG TATATAAACG ACTGCCGACT 24 0 

CTTTGATTAA TCTAAATTCT AACTTCCCTT GATTAACTGT ATATTTTTTC ACTAAACTTC 300 

CACTCAATAA AATTAAAGTT ATT ATTC CAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 360 

GCGGTCGCAC CTTATTCTTA ATATATATCT TAAATTG AT C ATGTGATTTT TCTGTTTTCA 420 

CAAAAGTTCC TTTAGTGTAA CCCATCCATG CAATAAAATG GTTTACAACG TTCTTTAATG 4 80 

TCCATCCCTT TGGTAAAACT ACCTTCATCG TTGATCTAAC ATCATCATAC TTTGAAACTT 54 0 

GTAATTCTAC ATTAACTAAA GAACGTTTAA AAACTAAATT TGTTGTTTCT ACAGGTGTAC 600 

CATATGCACC TAGGCGTTCA ATTGTTTCAT TATCATAACG ACTCCCAGGT ATGTAGATTA 66 0 

CTTTTTTTAC TTGATTGATA GCCGCTGCTC GACCAAAATT ATCTGCTGCG ATTAATGTTA 72 0 
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CAGCAATATC TATTTGATTC ATTGCTGCAA CAACCTGTTC GTAATGAAAT ATATCACACT 84 0 

GAATCCAAGT CATTTCAACA TCATCTGTTT kTTTATTGTC TGGATATTTT GATATAG CAA 9 00 

AAAGTTCAGC ATCATTTTCA ATCACTTCAC TTAAATA CTT ACCAATATAT CCTGTTCCAC 9S0 

CTGCTAATAA AACTTTAGGT TTCAT CTAAA ATACTCCTTT AAACTGTAAC CAAAAAACAT 102 0 

ATTACTCCAC CTTTTAG TTA CATATATATT ATAATAGTAG CAAATGTTTT AAAATTTCAA 108 0 

AATACTGGAG GCTTTTTATG GCCCATATTA TACG TAG AGT TAGTATCAAA GATGTAGAAA 114 0 

ATTTCATTTC AATGTTAGCG AACATATACG ACGAATCTCC GTATATGTTC TACACACCAG 12 0 0 

GAGAATATGA TCCTAGCGTC ACATCGGCTA GTAAACAATT AGAAGAATAT ATCACTTCTC 1260 

CGCATAAAGT CATCTTCGTT GCTGAAAGTG ATGAACAACT CGTTGGCTTT GCCTTTGTTA 1320 

ATACGACACC ATTTCAACX3C ATTAAACATG TTGCTAAAAT TGATTTAGGT GTAAAGAAAT 13 8 0 

TATATCAACA TCGTGGAATT GGCCAAGCAC TTCTTGATGC CATTATGGCT TGGTGTTTAA 144 0 

ACAATCAAAT ACACCGAATT GAAGCAAATG TACCACTCAA TAAC CAACCT GCCCTCGAGC 1500 

TTTTTAAAAG TGCCGATTTT CAAATCGAAG GCGTTTTAAA AGATAAGTTA TTTATCGATG 1560 

GTAAATATTA TGATGACTAT ATGATGGCTA AAATTCTTAA TTAAAGCTAT TTTATCATAA 162 0 

TCTTGTATCA GAATCG TATA ACAACGAATT TAATGGTTAC CTAATACATT ACT CATACTT 16 80 

ATCAATGTTA TCTAATCTCA AATAAATACG TACACTCTTA TTCATTTATC AAATTTAAAT 174 0 

TCAAAATAnA ACACCACTAA TGTGTAATTG ATTAACTATC AACTACGATT AGTGGTGCTT 180 0 

TATATATGTG G TTAGTTTTC CTnACTA 1827 
(2) INFORMATION FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 51: 

ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 

TGACGTTATT CTATGkTGTA CTCATTtTCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 

TTATTAAAAA G TTGAAAT AT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 180 

TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACG CCAAATG ATACGGCGTT CAGATTACAT 24 0 

TAGCATCTAT ACGTTAACAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 300 
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AATGCGTCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 42 0 

TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 48 0 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT 54 0 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 60 0 

TCACtGAATT CACCAA 616 



w 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



GCACGAGTGA 


TTAAACGGTT 


AATCAATGAA 


ACATTTGATG 


CAAATTACAT 


TGAAGTTATT 


60 


GAGGGAGGAA 


TTGAAGAAAC 


GCAAACGTTA 


ATTCACTTAC 


CTTTTGACTA 


TGTCTTCTTT 


120 


ACAGGAAGTG 


nAAATGTAGG 


CAAAATCGTT 


TATCAAG CTG 


CCAGCGAAAA 


TTTAGTTCCT 


1B0 


GTGACATTAG 


AAATGGGCgG 


TAAAt CTCCa 


GTCaTCGtTG 


ATGAAACAGC 


GAATATTAAA 


240 


gTTGCTAGTG 


AGCGCATTTG 


TTTTGGGAAA 


TT CACTAATG 


CCgGCCaAAC ATGTGTTGCa 


300 


CCAGATTACA 


TTTTAGTACA 


CGAATCTGTA 


AAAGATGATT 


TAATCACAGC 


CCTATCAAAA 


360 


ACGTTGCGTG 


AATTTTATGG 


TCAAAATATA 


CAACAAAGTC 


CAGATTATGG 


CCGCATTGTA 


420 


AACCTTAAAC 


ATTATCATCG 


TCTGACTTCA 


TTACTTAACA 


GTGCACAAAT 


GAATATTGTA 


480 


TTTGGTGGTC 


ATAGTGATGA 


GGATGAACGT 


TATATAGAAC 


CAACATTGTT 


AGATCACGTT 


540 


ACAAGTGATT 


CAGCAATTAT 


GCAAGAAGAA 


ATTTTTGGTC 


CTATCTTACC 


GATTTTAACG 


600 


TATCAGTCAT 


TGGATGAAGC 


AATAGCCTTT 


ATTCACCAAA 


GACCAAAACC 


TTTGAGTTTA 


660 


TATTTATTTA 


GCGAAGATGA 


AAATGCTACA 


CAACGTGTAA 


TAAACGAG CT 


ATCATTTGGC 


720 


GGCGGCGCTA 


TTAATGATAC 


ATTGATG CAC 


CTAGCGAATC 


CTAAATTACC 


ATTTGGTGGT 


780 


GTTGGTGCCT 


CAGGTATGGG 


ACGCTATCAT 


GGTAAATATT 


CATTCGACAC 


TTTTACACAT 


840 


GAAAAAAGCT 


ACATTTTCAA 


ATCnACACGA 


TTAGAATCAG 


GTGTCCATTT 


ACCACCATAT 


900 


AAAGGTAAAT 


TTAAATrCAT 


CAAAGCTTtC 


tTTAAAnATT 


AATT 




944 



50 (2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4B20 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

s 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

TCTCCAGTAG ACCTTGTGTA TGAACAGTTT CTTTCATATG AATGAACATC GTTTTTAAAG 6 0 

w TTTGTTTCAA CTCAGCCTTA TCAGGATAAT ATCTAGAGAC AGTCGTCTCT GGCATCTCCA 120 

TTG T ATGATA TTTAACCTTA TGCAGCGACC CATGATTTTC GTTATAAACA AATGTATGAT 180 

TCACTTCATC GAAATCATGA TCTTCTCCTG CAATCCAAAA AACTGGTACT ACTTGTTGCT 24 0 

15 TATGTGTATC CGTTAATTCC TTAGATAAAG TAATGATTGa AAATATTTTA TGGAATGTAT 3 00 

ACAATGGTCC CCCGAAAAGC CCTGCTTGTT GTCCACCAAT CACAACTTTT GAACCATTAG 3 60 

CTAAATGTTG TATGTTTAAT TCTTGTTCAC TTGAAAGCTT GAAAGCTTTA AATCACTCAT 4 20 

20 ATATTCACGA ATAACATTCG CTAATGCCGC TTCTCTTCCA TTATTTT CTT TAGACATCCT 4 80 

TTTTTCAAAA CTAGTTTGTT GAGCTGCATC ATATTGAAAT AATCCTGTTA TTACAGGGTC 54 0 

ACTGCTCTTT ATTTTTGGTA TAAACTGATC TTTTTCATTT AAACTAACTA CTTTACAGTC 6 00 

25 

CATGTTTTTT CTCCTTAAGT ACGCGATTAC AAACTATAGT ATAAAGTCTA TACCGGTGAT 660 

TGACAATTTT ACGGCTTGAA AATCAATTTA ATCATGGAAA ATTTATAATA TTCATTGTTT 72 0 

TACATTTTCA AATCAATGAA AAACACAAGT GGTTTAATGT ATAATAATAG TAGTAAACAA 78 0 

30 

ATAAGGGGTA GATAAATATG AG TG AAATC A AACGTCTTGA AATTAATTAC AAAACTGACG 84 0 

AATTATTCGA AAACTTTAGA GCGTTTGGCA ACAAAGACTT ATACATGGTC AATGAGTTAA 9 00 

35 ACGGTCAAAT GATTGATGCA AGTTCAGATT CACCATTTTA TGGCATATTT GTCGaGATCA 96 0 

ATTAGGAGCT AGAATGGCAT TACTAAAAAA AGGTGATGTC GAAGAAATCT ACTTCCCAGA 1020 

TTTTGAAGAT TATATATTAT TATGGAAGTT AGAAGTATTA CCAAAATATC AAAACAGAGG 10 80 

40 GTACGCTTCA GAATTGATTG ATTTTGCAAA GAGTTTCAAT ATGCCAATTA AAGCCATTGG 114 0 

CAGAAATGAT TCTAAGGATT TCTTTTTACA TCATGGATTT ACAGATGTGG AAGCTAAAAA 12 00 

TATAGAGGGA CATGATGTCT TATTGTGGAA ACCATAAGAT AATAATATTC GACACTACGA 1260 

45 GCATGAAAAT GCATCTTTTC GTAGTGTCTT TTTTACAATT ACTTTCTTAA GCTAATATAA 13 20 

GTaAATCATT TTCAAATTAT TTG T CTT AAC GTACAATATC ATTTAGTTGT TTCCATGrAT 13 80 

TAATTTCATA ATCAGGTATA ATTCCTGGAT TATGAT C AAA TCCTCTAAAa TTAAACCAGC 144 0 

50 AAGTAGCTAT ACCcGCATTG ATTCCACCTA GAATGTCAGA TGTTAnAGAA TCTCCaACTA 150 0 

TAATCGAGTG CTGCCTTTCA TCCTCACCAA TATCATTAAA AACATAATTA AAAAATTCCG 156 0 
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ACGGCGTCTG 


ATTTAACCTT 


CTCTTTTGCG 




ACCATTAGTA 


ACAATATATA 






AATCATGTCT 


TTTCGATAAT 


TCGACAATTG 










£ 


TAGCTTCTG C 


TAATCCATTT 


CTAAATAACA 


CATCTGCACG 


ATGCCCATCA 








GATGATGTTT 
TTTGkTGAAA 


GAAGTAATTC 


ACAAATCGTT 


CTGATAATAC 


TTCAGACTTC 


G k TAATTT AT 




10 


AGCTTCCCAA 


TGTTGGTGaT 




AAATGkTAAA 






TTGTTGCTTT 


ATGATTAAAA 


ACATTCGCCA 


TATAGTGAAn 










AAAAATCAAC 


AATTGTATCA 


TCAAAGTCTA 


T CAAAATATT 




AATTTCCCCA 




15 




TGTCTATGTA 




AACAGAGGCT 




CAAATAAAAT 




AAACTGAGTG 


CATAACATTA 


AAGTATG CTC 






T ATT AG TT AT 






TATATTAGAA 


TCCAAATAAT 




AACCCACACC 


GTTAGCAACG 


ATGTCTACGA 




20 


TACTTGTGCC 


TAATTTCACA 


CTATCATGTT 














TCCAGTCATT 


TAAATCTCTC 


CCTCACCTTT 




TGATTACTTA 






CATAACATAT 


TGAAATTAGA 


ATCCGAATAA 


TTTACCTAAT 










GATGCTCACA 




CTAATTTTGC 


GCCATCATTA 












GCGTTAAATA 


AACCTTCCAT 


TGAAAACACT 




TAAATTTGAA 






GATAACAAAA 


ACGTGCGTAg 


yTTTTAAATC ACCGAAATGT 












TAAAATAAAT 


TTGATG CAAT 


TAGTTTGTTT 


ATCCGCACAA 








TGTACTTAAC 




AGAGAAAAGA 


AATACAGTTA 




ACTGTATTTA 






ACACAATTAA 


GTTGCCTGAA 


TTCGTATTTA 


AGTCTTATTG 




GATAAATAGC 




35 


TCTATAATAG 


TGAAAAATAT 


AAACATTTTT 


TATTTACAAG 


GTATTGCTAA 








TTT AG AT ATA 




TGTTGTTAAA 


CGTGTCCTGG 


TAGCTCAGCT 


GGATAGAGCA 






ATGGCCTTCT 


AAGCCATCGG 


TCGGGGGTTC 


GAATCCCTCC 


CAGGACGTTT 






40 


TTATACGCAT 


TACCAAACAA 


AAGAGTTCCG 


TGATTACGGG 










CAGTAATATA 


GTATGATGCG 


TCACCAAAAC 


GTCCCCCGCA 


TAAGCCCCGA 






45 


ATTAAAACAA 


GCATGCTTAT 


TCGTTATAGA 


ATTTTTTGAC 


ACACAATTGA 


CACGCGTCTG 


3120 


ACACTTGTTT 


ATACATTTTT 


AATTAAGTAA 


TTTTGTGCTC 


AAATTTCATC 


TATACTGCAC 


3180 




CTGAACTACA 


CCAACACTAC 


AC CAAG ATTT 


TTAACACTCA 


CCATTTGCAT 


GCGTAGAGAT 


3240 


50 


TTTTATTATT 


ATATTATTCC 


TATAGATTTT 


GATACTATTC 


AAAATTTTAG 


GGACTTTTCA 


3300 




GGGGCCCGAA 


ATCCTATAAT 


TATAATTATA 


TACATCTAAA 


AAAAATAACC 


ACGTCCATCG 


3360 



55 
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ATACTATTGG CAAATTTATA AAGTAGTTCA GCGTTTTTCA ATGACATATT GTCTAATGAT 34 8 0 

CTTTCATTTT TTCTCATTCT GTGTATTGTG CTTTGTGGAA CTCCTGTTTG TTTCGATATA 3 54 0 

TGTAAACTGC TCAAATCACT GTCTAATAGT TTTTGAATTT GATTTCTCAT TTTGTACACC 3 600 

CCCTTGTAAA TCTTCAATCA TCATTTAAAT TAAATAATTA CTTTTCCACA TATTCCAATA 3 66 0 

TTTAGGTTGC AAAGCATACC TCAAATATCA TTAATTTTGA GATTTAAATG TCAATTATGT 372 0 

GTCTATCAAT CCAATATACA TACTCTAATA ACGTAATAGT ACACACTCTT CTTATTAATT 378 0 

GGCCATAGCT ATCATGATAT AATTAGTGAA GAAAATCACA TAAGAAAGGT TGTAAAT CAT 3 84 0 

GAGACTTCAA AAAG CACCTC TAGTAACGTC AGGACTAGTC TTAGGATTAT TAGGCCTGGG 3 900 

TAATCTATTA AAAGACTTAT CTCTTACTTT AAACGCTGTT TGCGGAATCT TTGCTTTCTT 3 96 0 

GATTTGGATT CACCTTTTAT GTACTATGAT CAAATATTTT AATAATGTGA AAGAACAATT 4 02 0 

AAACAGTCCT CTAG TTTCAT CAGTGTT CAC AACATTTTTC ATGTCTGGCT TTTTAGGTAC 408 0 

TACTTATTTA AATACATTTT TTAGTAACAT AACTTTTATC AATAGCTTAA TAACGCCTAT 414 0 

TTGGATTTTA TGCCTTGTGG GAATTATGAC GCATATGATT ATTTTTTCAA TAAAATATTT 420 0 

AAAAGATTTT TCACTTGAAA ATGTTTATCC TTCGTGGACT GTACTTTTTA TTGGTATTGn 426 0 

TATCGCAGGA TTGACGGCAC CCGTTAGCGG ATATTTTTTC ATAGGTCAAT TAACAGTAAT 4 320 

ATATGGCTTT GTAG CTACTT GTATTGTCTT ACCTATAGTT TTCAAGCGAT TAAAAGCATT 438 0 

TCCATTGCAG ACGTCAATCA AACCGAACAC ATCGACAATT TGTGCACCAT TTTCTTTAGy 4 44 0 

CGCTGCAGCA TATGTTATAG CTTTTCCTAA GGCGAATGCT TTTATCGTAA TTATATTTTT 4 50 0 

ACTATTAGCT CAAATATTTT ATTTTTATAT CATTATACAA TTGCCTAAAT TACTAAAAGA 4 56 0 

ACCTTTTTCG CCCGTATTTT CAGCTTTCAC ATTCCCTTTA GTAATCTGAG CAACTGCTTT 4 62 0 

AAAGAACAGT TTGCCTGTAC TTATGTkTCC AGACATTtGG AAAGGtcCTTT TGTTTATCGA 4 680 

AGTGTTATTA GCCACTGTAA TAGTACTTAG AGTCTTTATA GGATATCTTC ACTTCTTTTT 474 0 

AAAAAAGGAA AAACAAGATA AATTTCTnCG TAATGCGTCT CAGTAACACT ATTACCAAGA 4 8 00 

ATTAACACGT ATATTTAATA 4 82 0 
(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTK: 4 353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ATTTGGATCT TTAATATCAC CAATATTTTT AATATCTTCC GGATTCAATC CATATACTTG 60 

TACTGTATCT GAGTATTTAA TTGTGAAATA ATCACCTGAT TTAACTTTGT CATCAACTGT 120 

5 AATTTGTGAT TTTAATGATA AATAATC TTG GGCTGGTACG ATTTTATTGT TTTTATCTGC 180 

ATCAACGACA GTtAATGTTG TATTTGATGT GATTAAATCA TTAACATTTT TAGCCTCTGT 240 

TGATGATGGC TGTACTGCTG CTATACG CAT TCTTGTATTC AAACGTTTAG GTGCTGTACT 300 

10 

TTTTGGCAAA ATGATATCTG CATTATTTTC ATTATTTGAA TTACTATTGT TATCAACAAG 360 

AGTTTCATCA TTACTCTTGA TAGCATCACT TTTAACATTT AATGTAGTTG ATTCAGTTTT 4 20 

GGCATCTACC TTTTTGTTTT CCTCATTAGT TGGTTGAACA TTTACCACTG ATTTATTCTC 4 80 

15 

TTGCAAATCA GGTTGTAACG CTTCTTGATT ACTTATAGTT TGTTTAGTGT TTAAATCTTC 54 0 

ATTCGTAGAT TTTGGTGAAG CTTGCTCATC TGATTTGGCA GTTGAAACTT CAACTTTATT 600 

20 TCCAGTGGTA GATTGTACAC TTTCTTTTTC TATTAATTTA TTCCCATTTG AAGTCGTTTC 660 

ATTACCTTGa GATGATACCA TTTCTTTTTG ATTAT CATTT TTAGTATTGT CTTCTTGATT 720 

TAGTTGCTGC ATATCAACTT TATCACTCGA TTGATTATCA CTTGCTGAAG TTGTCGCTcG 780 

25 TTCAATT CTT TATTAGTACT TTCTGCAGCC TTTGCTTCTT GGTTCCCCAG ACCAAAAATT 84 0 

AATGTTGTAC CTACTAAAAT TGATGCTGTT CCCACTGTGT ACTTTCTAAT CGAAAATTTA 900 

TTTAATCGAT TGGATACCAT GCCTTTCCTT GTTATTGCCG TTTTATTTTC TCTGTTTAGC 960 

30 ATTAGATTAC TCCTAATTCA TCAAATTTTT AAATAATACA ATTGTTTTAA ATACAAAAAT 1020 

GTATAT CAAT ATAGTATTAC ATTTTTAGAT AAAGCACAAT ACTTTAATTA TTTTTCTTTA 1080 

TCGTAAAACG TTATTTAACA TTTGTGTTTA AATAAAAGTT TTTATGAGTt TTGTAATCTT 1140 

35 

TATTTAATCA TCATAAAAAA TAGTATTATT TGCCCTTGAA ATTAATATCT TAGCTTTTCT 1200 

AATTCATAGA CAATTACATT T CTGTAACAA ATTAAATTGT ATCTATTCCT TAAAGATTTT 1260 

4Q TTGTTTTATA TCTGGGAATT T CT AAACAG A AAAAACCAGG CCACATGGAC CTGGTTAAGT 1320 

T AAT CAT ATT ATTTATTTTG TTTTTTACGA CGACCGAATA ACAATAATGA TCCTAATGCC 1380 

GCGAATAATC CACCGAATAA TGTGCCATTA TTTGAATTAT TATTTTCACT ACCTGTTTCT 144 0 

45 GGTAATGCTT TAGCTGTTTT ATG CTGATCT TTAACCGTAC TCATTGGTTT AGCCGGAGTA 1500 

TGTTTACCTG CATCTGAATC TGAATCGCTA TCTGAATCTG AGTCGTTGTC TGAGTCCGAA 1560 

TCGCTATCTG AATCTGAGTC GCTGTCTGAA TCTGAATCGC TATCCGAGTC TGAGTCGCTA 1620 

50 TCTGAGTCTG AGTCGCTATC TGAATCTGAA TCGCTGTCTG AGTCTGAATC GCTATCTGAG 16 80 

TCTGAATCGC TGTCCGAATC TGAGTCGCTA TCTGAATCTG AATCGCTATC TGAATCTGAG 174 0 
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TCTGAATCTG AGTCGCTGTC TGAATCTGAA 
TCTGAATCGC TGTCAGAATC TGAGTCGCTA 
TCCGAATCGC TATCTGAATC TGAATCGCTA 
TCG CTATCTG AGTCTGAGTC GCTATCCGAG 
TCTGAATCTG AATCGCTATC TGAGTCTGAG 

W 

TCGCTATCTG AATCTGAGTC GCTATCTGAG 
TCTGATGTTT CTTCTTCGTA GTAGCCATTA 

;s GTTACATCAA CTTCGCCACC ATCGGCATCT 

TGAGTTAAGC CAGCAGGTTT TTCAAAGATA 
AAGCGGTATT TACCATTTTC ATCTGTyTCA 

20 TGCAAAGTAA CTTTAACACC TTTAATTCCT 

TTACTGTCGT ACCAAACATA ATCACCTAAA 
TCTAATGTCA TGTTGTCAGC GTCTTTAATG 

25 ACAGCATCAT CTGTACCTAC ATTTGCAGTT 

TCTACACTAT AAGTT CCATT GCTTAAACCA 
GTCGTACGAT CTAATTCTTT ACCGTTACTA 

30 

CCTTTTTCAT TGG CATCTTG TTTACCATCT 
AGATTATATT TCTTTTGGTC GCCATTAGCA 
TTTGACCAAC TATATTTAGT TTTGTCAGTG 

35 

GTTGAACTAT TATCTGGATA AGCAACTTGT 
TGGCCTTTCA TTAAATCGAC TGTAGCTGTT 

40 TGATCAGTAA CATCTTTAAG TTTTGAAGTA 

TGATCTGtCA CTtCGTAAAT TTTGAAGTTT 
AAATTAGTAA CAAACGTTTG TTTAGTATAT 

45 ATATTACGCG ATAAATCTTC ATTGTTAATA 

TTTTTATTAC CATAATCGAC AATGATTTCT 
TCCATTTTAT aagCTGTTTT ATCAGTTGTT 

50 

TCAAAGCTAC CTCTAACATT TGTATATTGA 
GTGTTTGTTG TACTATCATA AATACCTTTT 

55 



TCACTGTCTG AGTCTGAGTC GCTGTCTGAG 186 0 

TCTGAGTCTG AATCTGAATC ACTGTCTGAG 192 0 

TCTGAGTCTG AGTCGCTATC CGAATCTGAG 198 0 

TCTGAATCGC TGTCTGAGTC TGAGTCGCTG 2 04 0 

TCGCTGTCTG AATCGCTGTC TGAATCTGAG 210 0 

TCTGAATCGC TGTCAGAATC TGAGTCGCTA 2160 

TCAAGTGTGA AAT CAT CATG ATCCGTAATT 222 0 

TTATCATCTT CAGTTGTATT TGTACCTGTT 22 8 0 

AC TTTGT ATT TACCACTATC TAAATTATCA 2 34 0 

GTTGTACCAA TTACTTCGCC TTTTTCGTTT 24 0 0 

TTTTCAGTCG AATCTTGTTT ACCATCTTTA 24 6 0 

CTATATTTTG GTGTTTTGTA GAATCCACTA 2 52 0 

ACACCTGTTG TAGTTAGTCC ATCAGAATCT 258 0 

GTCGGTG TAT AACCGG CTGG TGTTGAAAAC 2 64 0 

GTGAACTGAT ATTTACCATT TTCATCTGTT 27 0 0 

TCTTTAAGAA TGACATAAAC ACCTTTAATC 2760 

TTATTTGTAT CTTCCCATAC ATAGTCACCT 282 0 

GTTGATGAGC CATTCACATT TGAATAACTA 2880 

TCTAAAGTAT AATCAATTTT TCCATTATCT 2 94 0 

TGAATGATGT ATTGTTTATT GCTGCTTGTT 3 000 

TTATTATCAT TACTATAAAT AACATCGAAT 306 0 

TCAGGGGTGA AACTATCCAC AAATTGATTT 312 0 

TTTGCATTTG GATTAAATTT ATATC CAGTT 3180 

GTATTTTTAG GTTGATTTAC ATATGCAGTC 3 24 0 

TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 33 00 

TCGCTATATG TATCATTACC TAAAGTTACT 3 3 60 

GCATTTTTAC GTTTCG CAAA TGCAACTTGT 34 20 

TCTACATAGT TCGTAAAAGT AT ATGTTG TT 3 4 80 

GCAATAATAT TACCTTGGGC ATTATATAAA 3 54 0 
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GTAAATGTAT CGCCCTCTTT AACAGAATCA TCGATTGTGT AATTTGC TTT T AATTT T AAA 3 660 

ACATCACTTG AAGTTGCCCA AAATTCAGTT TTACCAGTAG TCTGATTAAC ATGTCCTTTA 3 72 0 

TCAATCGCAA TGTCAATATT TGAAAAATGT ACTTTATCAT TAACATTTGT TCCTTGTTGT 3 78 0 

GGAGCTGCAA CAGTATT CAC TGCCATGCGA TTTAAAGTTC TTGGTTTAAT AGTCGTTGTT 3 84 0 

TTAGGTGTAG TTGAAACATC TTTTGCTTGT GTTAAATTAC TTTTATCAGT TTCATTACTA 3 900 

TATGTAGTTG ATGATTTATC ATTTGTTGTT ACATTGCTAG TTTTTGTAGT AGATTGATTA 3 960 

GCTGTAGCGT TTTGTGGTGA TTGCATGTTA CTACTAGTTT CTTTAACTGT TGCACTATCA 4 020 

CTCATTGTCA CTTTAGGCTG ATCTG CAGTT GCAGTTTGCG TATTGTCTTT TAGTTGACGA 408 0 

CTATCAA CTT TTTTAGTTGT TTTATTCTCA CTTGGGGCTG TCGTTTCATT TTTTGATTGA 414 0 

TTTAATTCTC CATTCGTATG TTCTGCCGCT TTAGCTTCAT GACCACTTAA CCCAAAAATC 42 0 0 

AATGTTG TCC CTACTAAAAT TGAAG CAGT A CCTACAGAAT ACTTTCTTAT CGAAAATTTG 426 0 

TTTAATCGAT TTGGTATCAT GCCTTTnCTA TTTGTnGCTG TCTTTTTATA ATTCATTTAA 432 0 

TAATACTCCT TTAAAATATC AAAATTTGAT AAATATAA 4 35B 
(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 
TTGACTTCTT AATTCAGCAT TTTCTG CACT TAATGCTTTG TTCTTTTTAA TAAGTTGCTT 60 
TCTTGCATAA ACTTCGGTAT CTATTTTACT ATTACTATAC CTTTGATTTA AAACTAATAT 120 

ACCAATTAAT GCTACAATGA TAATGATAAG TACAACATAA AAAGACATTT TTTCACCAAT 180 

CCTTTTTGAC TTCTTTAACT TTGTATACAA TAATAATTAA TAAAGATTAA TTGTTATTCA 24 0 

ATTTCCCACA TTTTTATTAG TTGATTTTAG TTCATCATTG TTATAATCAA ATTATAAACT 300 

GACAGATATT GATGTTCAAT GAATATGACG TGAAAGATTC GTGAATTCAA GTTTATGTCG 360 

AATTT ATGTT ATAACGGTCA TTTAAATGAC AGAATTAGGT CACTCATAGT ATTTTGAAGA 420 

TTGAATTCAT TAATTTTAAA ATGTATAATG ATATTTGTGA AAGCGCTTGC TTAGGAGGTG 480 

TATTTGAGAG TGAATGAAAT GAATGCTAAA GAACAATTAG TGGACAATTT AATGAAAACA 54 0 

TCATCGCAAT TATTTAAATT TCACGGTGAA GTTGCCATGC AGCTTTTCTT AAATGATGAA 600 
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AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 72 0 

AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 780 

5 ATAATGAGTA TTTTCGTTTA TGATGTAGTA AACGATGAAT GGATTTTTAG ATTGGATCCG 84 0 

AATATACGTA TACCTAAGAG TAACATATAC TTCCATAGTT TAAATTGGGA TGTGGATATA S00 

TTAAACCGGA GtCGTCTAAT GTATGTCTAA TGCaCACCAT CAGaTCATCA TtATCCATTA 960 

10 TAAcGrGCAT AGTG CATAAn yACTwCAtTT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 1020 

TCAGGACATA ATATAGGAAG CATCAACGCG TGAnCAGGTC 1060 



(2) INFORMATION FOR SEQ ID NO: 456: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 56: 

25 ATAATnACTA AATACnAAAG TTTAACT GTC TTACTAATAA TGACTATGTT ATAATTTTAA 60 

AAG TGATATT TTGGGTAATC GCTATATTAT ATAGAGGAAA GTCCATGCTC ACACAGTCTG 120 

AGATGATTGT AGTGTTCGTG CTTGATGAAA CAATAAATCA AGG CATTAAT TTGACGGCAA 180 

30 TGAAATATCC TAAGTCTTTC GATATGGATA GAG TAATTTG AAAGTGCCAC AGTGACGTAG 24 0 

CTTTTATAGA AATATAAAAG GTGGAACGCG GTAAACC CCT CGAGTGAGCA ATCCAAATTT 300 

GGTAGGAGCA CTTGTTTAAC GGAATTCAAC GTATAAACGA GACACACTTC GCGAAATGAA 3 60 

35 GTGGTGTAGA CAGATGGTTA TCACCTGAGT ACCAGTGTGA CTAGTGCACG TGATGAGTAC 4 20 

GATGGAACAG AACATGGCTT ATAGAAATAT CACTACTAGT TTAGCTCTCC TAGATGATGG 4 80 

AGAGCTTTTT TCATGAAAAG AACACTTAAA ATTAACGCCy TGTCTTGaTA tAATGACaCT 540 

40 

GCcTTGTTTT AAAATAGTAA GCGGATGCgT TAATGTATCA GCGATTAAAT TTGTTGGAAA 6 00 

TGTATAAAAA ACACAAGCTA AGAATAAAAT ACCTGTATAA AAGGAGAATC ATATATGTTT 660 

4S CAATTACTTG CAGTTTGTCC GATGGGATTA GAAGCTGTTG TTGCTAGGGA AATTCAAGAA 720 

TTAGG CTATG AAACAAATGT TGAAAATGGT CGTATATTTT TTGAaGGAGA CGCAAGTGCA 780 

ATTGTAAAGG CAAATTTATG GTTGCGCACA GCAGACCGAA TCAAAaTTGT TGTTGGACGT 84 0 

50 TTTAACGCAA CAACGTTTGA CGAATTATTC GAACAAACCA AAGCGCTCCC TTGGGAATCT 900 

ATAATTGATA AAGAGGGTAA CTTCCCAGTT CAAGGTAGAA GCGTTAAATC AACACTACAT 9 60 
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TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 1080 
TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTGGT 1140 
TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAG TTTAATCCGT 1200 
CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 1260 

1262 

(2) INFORMATION FOR SEQ ID NO: 457: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 2 base pairs 

(B) TYPE: nucleic acid 

'5 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

CCTGGCTGCT TTATCAG CAT CTACTACTTT AAAACCGAAT ACGGATAAGA GTTCTGATAC 6 0 

TGTTGATTTT CCTGAGGCGA TTCCACCTGT TAGACCAATA ACTTTCGGCA TAATTT CACT 120 

25 CTTTCTTTAT TTTTGACATA CTGGACAATA ATGACTATTT CTTGTCGCGA TGATTTTTGT 180 

TTCAATTTGA CTTCCACACA CTTTGCATAC CGGCTGCTTA TATACATTAA GATGCAATTG 24 0 

CATCTCACCA GTTTTT C CAT CAGCATGACG ATAATCTGAA ATACTTGTAC CGCCATATTT 300 

30 AATACCTTCT TCTAGTACTT CTCTAACATA ATAAAAAACC ATTTCTTGTT GTTGGTGTGT 360 

TAAGTCTTTT ACTTTTTTAT CTGGTAAAAC ACCTGCACGA AACAACGCTT CA CATGCGT A 42 0 

AATATTTCCA CAACCTG CGA TTACTTTATG ATCCAAAATC ACTTGTTTGA TTGGTTTATT 480 

35 

CTTATTAGAC TGTTGATGAA TTCGATTTAA ATAATACGTC AATGCTTCAT TTGAAAAAGG 54 0 

TTCAGGCGCT ATTTCTAAAA ATGAA GGAT A AGATGCTACA GACGCAACAT TTCTAATTTC 60 0 

TCCAAAACGA CGTATATCTG AATAAATTAA CTTTTTGTCA TTTGACAACT CAAAAATAAC 660 

40 

ATGCCAATGC TTACGATAAT TAGGTATCAT AATATCTTCA AGTTCATCTA CAATGAAAAA 72 0 

ACCGCCCGCC ATACCTAAAT GACTAATTAA TGTACGTTGT TCTCGTTTAT TATCTAGCTG 78 0 

4S AAAAACGATA TATTTACTTC TTCGTTCTAC ATTTGTAATG GTATAGCCTT CCGATAAAGT 84 0 

TTTAAAAGTA TCTAATTCAA TTCCTTTTAT AATTGTTTCC TTGCCTTGAG CTTTACCTTC 90 0 

GATTACTTTA TCCGAAAATA TAACGTGTTC AATTTTTTGA TTTATAACGT AGGGTTCAAT 96 0 

50 TCCTCTTTTT ACATGTTCTA CTTCTGGTAA TTCGGGCATA CCATTAACCT CACTTTATTT 1020 

TGCATCATAC CAGGTTGCAC CAT AA CTTG A GTCTACTTTT AATGGAACAT CTArTTGCAA 10 80 
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TT 1142 
(2) INFORMATION FOR SEQ ID NO: 458: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 

CCTTTAGTAA ACAATCCTTC TTTAGTTTTA GTACGTTGTT CCAATCCAAA TAATTTGTAT 60 

is 

TTCATTGCCT CGCCCGATTG AGTGCCGCTA AAGTTATCAT CTTTCATGTT AGGCGTGTTG 120 

GTAAACATGT GTATATCACT GTTTAAACGG TCTTTATAAG CTTCGGTACC TTGTACAT CG 180 

20 TATTGCTTAT AAATATAACC ACCATCAACA GAGCCTTCTG TTTCTCTACC TTCGCTATCA 24 0 

GCATAAACAG TCGGTTCTAA AAACAACACG TTAGCTTCCT TTTGTTTTCT AACTTCTACA 300 

GGATCTAAAT TTAAATTACC TTTAATAAGT AACATAGCGT CATTTAAATC ACTCATATAG 36 0 

25 TTAGCAsymy CTGATTCAGC ATTATCATAC AAATCAATTA AAGTGATTAC TTTCTCATAA 42 0 

TCCCCTTTTC TTCTTTCGTT GTTGCTAAAT TCTGTAATAG GCATACGTTC GAAAGAGTGT 480 

GATTCAAAAC CGTTTTCACG TGGTGTGAGC TTCAATCCAT TTGTTCTACT GGTAAGATAT 54 0 

30 CTATAAACAC CGTGTGAAGT GAATAAATCA ACTGTAAACA CTTCATCTTC GTCAGTCTTG 600 

TCTATTGG TT TAGTTCTTAA ATATCTAACG CCTGCGATAC TATTACGTTC AATTGTATTG 660 

TCGTATATGA CAAAAGTACT CATTGCATCA CTCTTGTATA AACGCGTTTC ATCATCTTGG 72 0 

TTTCTAATCA TTAACTCATA AGCTTTGCCA TAAATTGACA AATCTAATCC TAAAGATCTA 7 B0 

TTGTGTGACT CAACATCATT TAAATCATTG AACGCCTCAA TAACTTCTAA TACAT CTTTG 84 0 

TCATCATCTT GATATTGAAT TGGATTACCC AAGAAATAGC CGTTGATAAA ATCGCTAATA 900 

40 

TAAGATGCGT AATCATGCGC TACACGGTTA TCTGCCATGT ACTCTTCTTT GCGTCGTGTT 960 

AACTCAACTA AGTTCTTAGT TTTACCTTCG TAATAATCAC TTAACACTTT CAATCTAGGT 102 0 

45 CGTTGGTAAT CCATGTGATG TTCAATGTAT TTACTTACTT CATTAACGTT TTGTAATAAA 1080 

TCGGATTCCG TCCCGTCATA TGTGTAAACA ACATTGGCTT CATCATTAAA TAAGTAATTT 114 0 

ATGTTTCCCC GTAGATCTGT ATCTGTTTCA AATTCGTTTA CTTTTAACAT TTGTTCCCTC 120 0 

SO CTATAATCCT AGAGATTTTA TTGTGTCAAC TTTCGAACTG AGATTTGTGC GTTTt CTAAC 126 0 

CGGTCTGTAG AATCGTTCCA CTGAATAACG CAACGAATCG ATACAATGAT TGTATGTATC 132 0 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 144 0 

TAACCCTTGT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 150 0 

5 ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 156 0 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 162 0 

CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAG CACT 1680 

10 AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 174 0 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 180 0 

ACCTAACGCA TAAA 1814 
(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

AATTnAGATT ATTACCCTCC TTTAAAAATA TTTGTTT CAC AATTTTTTAT TACCTATTTA 60 

CTGGTTTCAT GTCTTATGGG G CATTTT ACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 120 

TCTTATATGT TTCTTGGCTT TTCATGACTA TGCTATTTAT GTTTATGAAT ATG CATTCAA 180 

TTATAGATAA AAAAGTACAT ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 240 

TTTTCAATTG TTAATTAGGG GTGGTAAACA GTGCTTTGTG AATCTAGACA AATTTATAAA 3 00 

AATCCTAAAT ATCGAGTTAT TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT 36 0 

ACTTGGATTA CTTATTTTTT CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 420 

ATT AG CG AAA ATGAATTTGA AAGGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTTTTT 480 

TGGCCGGTTG CAGGAAG TTC AGTTCTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 54 0 

TTTAATGTTC AGTTTGAAAA ACAACT AG CA ATCACTGTAT TTTTTATCAT GTTAATAGGG 600 

ATGTTAATTT TTTATTTTTA TCT AAA TAAA AAATTAACAT TAAAAATTTT TAATACCAAC 66 0 

GTGGGTAATA AGAATAGGAG TTGTAT 686 
(2) INFORMATION FOR SEQ ID NO : 4 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

5 ATCTGCAATT ATGGGCACAC CCAAGCTnAT GCAAGTAACT AAAGGAGAAG TACTTTTAGA 6 0 

CGGTGTAAAT ATTTTAGAaT TAGAAGTTGA TGAAAGAGCA AAAGCAGGAT TATTCTTGGC 12 0 

AATGCAATAT CCATCAGAAA TTACAGGTGT TACAAATGCT GATTTCATG C GTTCAGCAAT ISO 

CAATGCGAAA CGTGAAGAAG GACAAGAAAT CAACTTAATG CAATTTATTA AGAAATTAGA 24 0 

TAAAAACATG GATTTTCTAG ACATAGATAA AGACATGGCA CAACGTTATT TAAATGAAGG 3 00 

TTTCTCAGGT GGAGAGAAGA AACGTAACGA AATCTTACAA TTAATGATGT TAGAACCTAA 360 

75 

GTTTGCaATC TTAGATGAAA TCGATTCAGG GTTAGACATC GATGCATTAA AAGTTGTATC 4 20 

TAAAGGTATT AACCAAATGC GTGGGGAAAA CTTTGGTGCA TTAATGATTA CACACTATCA 480 

20 ACGATTATTA AATTACATTA CTCCTGATAA AGTACATGTA ATGTATGCTG GTAAAGTCGT 54 0 

TAAATCTGGT GGTCCAGAAT TAGCAAAACG TCTTGAAGAA GAAGGATATG AATGGGTTAA 60 0 

AGAAGAGTTC GGTTCAGCTG AATAATCTTA TTAATACAGT ATCCATGAGA TGTTCATCTA 6 60 

25 TATATGATGA AAATGAACAT TTATACGAAA TAG T AAATTT CATCAAGTAG GAGGAAAAAG 72 0 

TTATGACAAC TGATATTTTG rACaTTyCTG AAGAACAACT TGTTGATTAT TCTAAAGCCC 780 

ACAATGAACC TTCTTGGATG ACAGAATTAC GTAAAAAAGC TTTGAAATTA ACAGAAACTT 84 0 

30 TAGAAATGCC AAAACCTGAT AAAACAAAAT TAAGAAAATG GGATTTTGAT TCTTTTAAAC 900 

AACACGATGT AAAAGGTGAT GTTTATCAAT CTTTATCACA ATTACCTGAG TCAGTAAGAG 9 60 

AAATTATTGA CGTAGATCAT TCTAAAAACT TAGTAATTCA ACATAATAAT ACGATTGCGT 1020 

35 

ACACACAAGT TGATGATAAT GCATCGAAAG ATGGCGTTAT CGTTGAAGGT TTAGCAGACG 10 80 

CTCTTATGAA CCATAGTGAT TTAGTACAAA AGTACTTTAT GAAAGATGCA GTAACAGTAG 114 0 

ATGAACATCG TATCACAGCG CTACACACGG CATTAGTTAA TGGTGGCGTA TTTGTTTATG 12 00 

40 

TTCCTAAAAA TGTAGTTGTA GAACATCCAG TACAATACGT TGTGTTGCAC GACGACGAAA 12 60 

ATGCAAGCTT TTATAACCAT GTTATCATCG TTACTGAAGA 13 00 

45 (2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 

tA) LENGTH; 3135 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 



55 



1376 



EP 0 786 519 A2 

GACAGCAAAT TCAAGATACA TTAAATAAAG ATATTGTCAT AAAG CAT ATT CTTGTTCGAG 6 0 

ATAAAT CTAA AAAGAGACCG CTAAATATTA GCCAATATCA TTTAACTGAA GATGTTAATG 12 0 

5 AAATTTTAAA T GATGATTCA TTAGATATTA TCGTTGAAGT CATGGGAGGA ATTGAACCAA 180 

CTGTAGATTG GTTAAGAACA GCACTTAAAA ATAAAAAACA TGTTATTACC GCAAATAAAG 24 0 

ATTTATTAGC AGTACATCTT AAACTTTTAG AAGATTTAGC AGAAGAAAAT GGTGTAGCTT 300 

10 

TAAAGTTTGA AGCGAGTGTA GCAGGTGGTA TTCCGATCGT AAATGCCATA AATAATGGTT 3 60 

TGAATGCGAA TAATATTTCA AAATTTATGG GAATTTTAAA TGGTACCTCT AATTTTATTT 420 

TATCTAAAAT GACTAAAGAG CAAACGACAT TTGAGGAAGC ACTTGATGAA GCGAAAAGAC 4 80 

75 

TTGGTTTTGC TGAAGCGGAT CCAACTGATG ATGTAGAAGG GGTAGATGCA GCGCGTAAAG 54 0 

TTGTCATTAC ATCATATTTA TCATTTAACC AAGTCATTAA ATTAAACGAC GTTAAACGAA 600 

zo GAGGAATTAG TGGCGTAACT TTAACTGATA TTAATGTAGC CG AT CAACTG GGGTATAAAA 6 60 

TTAAATTGAT TGGTAAGGGA ATATATGAAA ATGGCAAAGT TAATGCATCG GTAGAACCAA 720 

CGTTAATTGA TAAAAAGCAT CAATT AG CAG CTGTAGAGGA TGAATATAAC GCGATTTATG 7 BO 

25 TTATTGGTGA TGCCGTTGGT GACACGATGT TTTATGGAAA AGGAGCAGGC AGTTTAGCAA 84 0 

CAGGTAGTGC CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 900 

ACACATTGCC ACCACATTTT GAATT AAAG A CAGATAAAAC ACGGGAAATG ATGGATT CAG 960 

30 ATGCAGAAAT TAATATTAAA GAAAAATCCA ATTTCTTTGT AGTAGTGAAT CATGTCAAAG 1020 

GTTCAATTGA AAATTTTGAA AATGAGTTAA AGGCAATATT ACCATTTCAC CGATCATTAA 1080 

GAGTTGCAAA TTACGATAAT CAATCATATG CCGCTGTTAT AGTTGGATTG GAATCATCAC 114 0 

35 

CGGAAGAATT AATCACTAAG CATGGATACG AATTGACAAA GTATACCCAG TAGAAGGAGT 1200 

TTAATTATAA TGAGAAGATG GCAAGGATTA GTAGAAGAGT TTAAAGCACA TTTACCAGTA 126 0 

AATGAAAATA CACCAAAATT AACATTGAAC GAGGGAAATA CACCACTCAT TCATTGTGAA 1320 

40 

AATATGTCTA AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAGGTGC CAATCCGACA 1380 

GTTCATTTAA AGATCGCGGT ATGGTAATGG CTGTGACAAA AGCAAAAGAG CAAGGTAAGA 1440 

45 AAATTGTAAT ATG CGCTTCG ACTGGAAATA CATCAGCGTC TGCAGCAGCA TATGCAGCGA 1500 

GAGCAGGTTT AAAAGCTATC GTCGTAATAC CAGAAGGTAA AATTGCATTA GGTAAATTGT 1560 

CGCAAGCAGT AATGTATGGT GCAGAAATCG TTTCTATTGA AGGAAACTTT GATGAAGCTT 1620 

50 TAGAAATTGT AAAAGAAATT GCAAAAAGTG GCGAAATCGA GCTTGTAAAC TCTGTCAATC 168 0 

CATTTAGAAT CGAAGGACAA AAGACAGGCT CATTTGAAAT TGTACAACAA TTAGACGGTG 1740 
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AAGGCTTTAA AGAATATCAT GAAGCTAAAG GATCACAATT GCCGAAAATG TTTGGCTTCC 18 60 

AAGCTGAAGG CGCATCACCA ATTGTTCAAA ATAAAGTCAT TAAAAATCCT GAAACGATTG 1920 

5 CAACTGCTAT TCGAATTGGT AATCCTGCTA GTTGGGATAA GGCGACTAAT GCTCTTAAAG 19 BO 

AATCAAATGG ATTAATAGAT AGTGTTACTG ATGATGAAAT TCTAGAAGCA TATCAGTTAA 2 040 

TGACAACTAA AGAAGGTGTC TTTAGTGAAC CAGCGAGTAA TGCTTCTATT GCAGGTTTAA 2100 

w 

TTAAATTGCA TAGACAAGGT AAATTACCTC AAGGTAAAAA AGTAGTTG CT ATTTTAACTG 2160 

GTAATGGATT AAAAGATCCT G ATACTG CT A TTT CACTACT AGATAATCCG ATAAAGCCAT 2220 

TGCCAAATGA TAAAGATAGC ATTATCGATT ATATTAAAGG AGCTTTATAA CATGTCGAAT 22 8 0 

15 

GTTTTGGAGT TAACAATTCC TGCATCAACA GCCAACCTTG GAGTTGGCTT TGATTCTATA 234 0 

GGTATGGCTT TAGATAAATT TTTGCATCTG TCTGTAAAGG AAACATCAGG GACAAAATGG 24 0 0 

20 GAATATATTT TCCATGATGA TGCATCTAAG CAATTGCCTA CTGACGAAAC AAACTTTATT 24 60 

TATCATGTAG CACAACAAGT TGCTTCTAAA TATAGTGTTG ACTTGCCTAA TTTATGTATC 2520 

GAAATGAGAA GTGATATTCC ATTGGCAAGA GGGTTAGGTT CGTCAGCTTC TGCTTTAGTA 2580 

25 GGAGCTATAT ATATCG CAAA TTATTTTGGT GATATCCAAC TGTCTAAACA TGAGGTATTA 2 64 0 

CAATTAGCGA CTGAAATCGA AGGACATCCT GATAATGTTG CGCCGACCAT TTATGGTGGT 27 00 

TTAATCGCTG GATATTATAA TGATGTCTCG AAAGAAACGT CaGTtGCACA TATCGACATA 27 60 

30 CCAGACGTGG ATG TGATTGT AACGATACCA ACTTATGAAC TAAAAACAGA AGCATCAAGA 2 820 

CGTGCTTTAC CACAAAAATT AACACATAGT GAAGCGGTTA AAAGTAGTGC AATTAGTAAT 2 880 

ACAATGATTT tGgCATTAGC ACAGCACAAT TATGAATTAG CAGGTAAACT CATGCAACAA 2 94 0 

35 

GATGGCTTTC ATGAACCGTA TCGTCAGCAT TTAATTGCTG AATTTGATGA AGTGAAAACA 3 000 

ATTGCTAGTC AACATAATGC CTATGCAACT GTAATTAGTG GTGCTGGACC AACTATTTTA 3 060 

ATATTTAGTC GTAAAGAAAA TAGTGGGGAA TTGGTTCGCT CTTTAAATAG TCAGGTAGTA 3120 

40 

TCATGCCATT CTGAA 3135 
(2) INFORMATION FOR SEQ ID NO: 4 62: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

55 
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AGnTCAATAT TTAGATCAAG CCGTTTTAAG TAATTACGAA CAAGTTTATA TCATTCATGG 120 

TAAAGGTACA GGTGCACTTC AAAAAGGTGT ACAACAACAT TTGAAAAAGC ATAAAAGTGT 180 

TAGTGACTTT AGAGGTGGTA TGCCAAGCGA AGGTGGATTT GGCGTTACCG TTG CAACACT 24 0 

AAAATAAATT ATAATTTGAT AAATTAAATA GCTGCAGTTA AAATAATGTA AAGCAACAAG 300 

AATACATTTC AAACATGTTA TTTGAAATAA GCATAAAAAT TGAGCAAATA GAAATACATG 3 60 

AAGCATG TT A TCTGATATAA TTTGAACATC ATAATAATAA TTAAGGAGGA TTGGCATTTA 4 20 

TGGCAATCGT AAAAGTAACA GATGCAGATT TTGATTCAAA AGTAGAATCT GGTGTACAAC 4 80 

TAGTAGATTT TTGGGCAACA TGGTGTGGTC CATGTAAAAT GATCGCTCCG GTATTAGAAG 54 0 

AATTAGCAGC TGACTATGAA GGTAAAGCTG ACATTTTAAA ATTAGATGTT GATGAAAATC 6 00 

CATCAACTGC AG CT AAATAT GAAGTGATGA GTATTCCAAC ATTAATCGTC TTTAAAGACG 660 

GTCAACCAGT TGATAAAGTT GTTGGTTTCC AACCAAAAGA AAACTTAG CT GAAGTTTTAG 7 20 

ATAAACATTT ATAAGTTACA ACCAATGACG ACTGGGGCAT TTCTTTAATG AATTGCTCCA 780 

GTTTTTGTTT GTGTTTTTAA TATAAAAAGT TGAATGATAA GTCATCATAT TGTTTACGAC 84 0 

TTGAGAATGG TGGGATTAAT AAATCTATGA ACGTTAAATG ATAATCTAGC ATGCTGATAG 900 

ATTTGTAGCA GTTGGTTTGA TAAAACCATG TTCAATATTA CATGATGTGC ATGAAAAGTC 9 60 

ATACTCGAAG ATGTTGATTA TTAACTAGAA TTAGTGGTGA TAAATTTGAA GCACTTTTGT 1020 

AGCATCATTC ATTTTAAAAT TAGAAGGGGG GATATTTTTG GAAGACTATA AG CAACG AAT 1080 

TAAAAATAAA TTAAATGTCG TACCTATGGA ACCAGGATGC TATTTAATGA AAGATCGTAA 114 0 

TGATCAAGTG ATATATGTTG GCAAAGCTAA AAAGCTAAGA AATCGATTGC GATCATATTT 12 00 

TCACGGGTG 12 09 
(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 6 0 

TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 12 0 

AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 180 
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GCATCTTCAA CTTCAAAAGG CCCAATCGTA TAGCCTGAAC TAATAATAAT GTCATCTCGA 3 00 

CGTCCTTCAA ACCAGAAATA ACCATCATCA TCTACATGAG CTAAGTCACC AGTGATGTAG 36 0 

5 TATTtACCTG TTTGCGCTTT CGCCGTACGT tCTGGCTCTT tATAATACCC TTTGAAAAGT 4 20 

GCTGGCAAAT CAAGTGGTAC TG CAATATTC CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 4 80 

TCATCATCTA CTACAGTGAC CGAACTACCC GGAATGCCTT TACCCATTGA TCCAATCCTC 54 0 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC AAGGTACTTT CAGTCTGGCC ATATCCATCT 600 

CTTACAGTTA AATTAAAGTA TTTCTTGAAT TGTTCAACTA CTTCTCGATT TAGTGGCTCA 6 60 

CCTGCAGAAA CGG CACTATG TAAATGCGTT AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

15 

GCCATCATAC GATATTCTGT CGGTGTACAA CATAAAACAT TAATTTGATA TTTTTG AAG C 780 

AATTCTAAGT ATGTTTCAGG ACTGAACCTT CCATTAAATA CAAAAGCAGT TGCACCTGAA 84 0 

20 CCTAATACAG ATAAGAAAGG ACTCCATACC CATTTTTGCC AACCTGGTGc TGCTGTTGCC 900 

CAAACTAAGT CATCTTCATT aATACATaAC CAATGTTTTG GTGCCATTTG TaAATGTGcA 960 

AATCCCCaTC CATGACAATG TGTAACGGCT TTAGGATTGC CAGTTGTACC AGATGTATAT 1020 

25 GACAGAATCG CCATATCATC ACGCGTCGTA TCTGCCATTT CTAGTTTGTT ACTTGCGTTT 10 80 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA TCTTTTTGAC CGGCAATAAC AAATTTAGTT 114 0 

AACGCATCAT ATTCTTTAAT TTTTTCAAAT TCAACTGTGA ATGGCTCTAG TGCAATAACT 12 00 

30 GCATTAATTT CACCATGTGT GATACGGTAT TGTAAATCTT TAGTTCTTAG CATTTCAGAA 1260 

CATGGAATGA TTGCAACACC TAATTTTAAA GCAGCAATAT ATAATTCATA CGTCGCAATA 1320 

GATCGTGGCA TCATAATGAG TACTTTATCG CCTTTAGATA AACCGTGCGA TGCTAAAACA 13 8 0 

35 

TTACCTACTT TATTAGACTG TTCAATGAGC TGTTGGTAAG TGACTGATAT ATCTTCGCCT 144 0 

TCAGTATTAT GATATAAAAT TGCCTTTTTA TCTGGTATGT GGCTATATTT TTCGATTTCC 1500 

^ GAAATAATGT TATATTTTTC AGGCGCGAAT AGAGCTGACT TTTGCATAAC TAACTTCCTT 156 0 

TCATACATCC ACTTTTCCTG TGATGAACAT TGTAATTTTA TAAATGAATT ATATACATCA 1620 

TACGCCTATC TTTACAGAAT TTTCAATTAA ATAGGGTTAA ATACCAAAGT CCTCGACACT 16 80 

45 ACACTTTGAC ATGACGTAGC ATTCAAGGAC TTTCAAATGA TTGAGGGTTG ATATCTCGGG 174 0 

CTAGACCATA TCAGCTAATT CAATACGAAT ATTGTATGAT AATTCACGAT TAATTATTTT 1800 

TACATCTGCA C CTTTCGAAG TGCCACGATG CTTGTGTGTA TGCTTGTACT CAGCTGAATT 186 0 

so TTGCCAATGA TAAAATGCTT GCCTATTTTC CCACAGCGTA ATAATGATAT AGTGTCTACC 1920 

AGCTGTTCTA GGTCTTAAAA ACCTTAATGC TTTAAATCCA TCAACGTTTT TTAAATG CTT 198 0 
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ATTTAACACA CATAATGAAT CATTTGATAA ATCATTTATC GCTTCTAGCA CATCGTAATA 2100 

TGCAGTGTCA TTATTTTTTT GTATTGTGAG ACAATCATCC AGTTCTTCTA TTACATAACT 216 0 

TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 2 220 

ATTACTkTAT ATCTATTACT AAtkCATTCC CGTATTTATT AATTACAATC ATAGTTTGGC 2280 

TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 234 0 

GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 2400 

AATAGCCGAC 2410 
(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

TTTATTAATT GTAAAAAATT GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 6 0 

ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 120 

ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 180 

CGAACAGCGG GTTTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 24 0 

TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 300 

ACTACTTTTG CAGTTGCGTT TATTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 3 60 

TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT T AC CATTAAT 420 

ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 4 80 

TCATTAATCA AGTTATTATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 54 0 

AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 59 0 
(2) INFORMATION FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 
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TATTCGATTT GAnTCTTTTA AAATTATGTT TAAAAACATG TCTAATGATT CCGCATCATT 120 

TTTTGCACCA TCAATAAGCG TTTCAGcAAA CCC CTTAATT GAAGTAATAG GTGTTTTTAA 180 

5 TTCATGTGAA ACATTTGCTA CAAATTCACG TCTTAGATTT TCAAGTTGTT TCAGATTTGT 240 

TATATCATGC ATCACaACTA AAATCCCtTG CAAACTTTTT TGAGACCTAG TTAAAATCGG 3 00 

AACGCATGAA ATATCAAAGT ACTTGG CATG GACTTGGTTT ATTGCAACTT CCAATTGTTC 3 60 

w 

AT AAATAGG T TTTTCAACTT TAAAACTTTC TAAAATTAAT TGCTCAATTT CAGTATTAAC 420 

ATAGCCGTGA TAG C CTACTT GTTCAATATT ATGCGAGATG TTGAACTGTT CATAATACGC 4 80 

TTTATTTGCA ACAACGATTT TTCCATTTCG ATCTATCATT AAAATAGCAC TTGGAATATT 54 0 

/5 

TTCAATCGTT GTTTTTAAAC GGTTGGATTG AATTTTTTGC TCATTATTAA GCTTTTGAAG 600 

GCGTCGTGCT AAATCATTGG TAGA CACAAA AAGCGCTTTA GTTTCTACAA CATTACTTTC 660 

2o AGGTACACGT ATGTGATAAT AACCATTTGC CAACAATTGT GTTGCATAAG TAA CTTCTT G 720 

AATGGGACGG ATTAATGTAC GCTTAAAACT ACGGCTTGCA AAATACAGAC AAATGAGTAC 780 

AACTAAACAT GTCAAAATAA GATATTTCCA CAACGTCCAA TGCATTTCTG TAATATCGTT 84 0 

25 ATTGTAACCT TTAAT CCATA CATGATAACC GTTAACCTTC TTATtAAAAA TAAAAACGTC 9 00 

CCTTT 905 



(2) INFORMATION FOR SEQ ID NO: 466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 
TTTGGTTAGC CCATAAAAGA AAAAAACAAG TAGTCATTTT TAAACAACAT ATCAAGTCTA 60 

CCCAAGAAAT ACGTTTTGAC AAAGCGAAAG TGCTTGAACA CAAAGATGAA ATAGCAAATT 120 

TTATTTCTTT CGAACCACAA AGTTTTGAAT TTTATTATTT TACAGAATCG GAATTTTCAG 18 0 

AAGAACAATT AAATGAAGTT TCGCCAATTA GAATTAAATT CAATGTTATA AGACACACAA 24 0 

AAGATTTGaT AAAGCATATG C CGAATATAT TTTTGG cTAG ACTTATTTCA GAAGATAATG 300 

ATAAAAAGAC ATATATGTTC TATAAACGCA AAGTATTAAC CGATAACTTT TTAGATAAAT 3 60 

ATATGCAGAA ATTTTCACCG GCAACATACA CAATAATATT TGTAAATGTC TTAATATGGT 4 20 

TATGTATGAT TTTATATTTA AATAATTTTT CGGATGTAAA ATTATTAGAT GTTGGCGGGT 4 80 
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ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG 600 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG 660 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTG CTA 7 20 

TATTTGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 780 

AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 84 0 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 9 00 

TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCCTGC 96 0 

tATATTTAwT GCACyTCmAA TTAgAtTTTT ACmATTAAAG AAGATAATAw TTATAA 1016 
(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 60 

ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 120 

CTATAAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 180 

TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TT AG CATGG C 24 0 

TAAATTGTTG AAGACCCTTA CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 300 

CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA ACtTCGTTAC AATCaTCGTG 360 

kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 406 
(2) INFORMATION FOR SEQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATG CGAA TTGAAAACTT 60 
AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 120 
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AGGTACAGCG TTAAAACTAA GAGAAGCCAT CAATTATGAT GAAATGGTTA TTG TAG AT AG 24 0 

TATGACGTAG TTC CTAATT A TGCkAAAAGG GATTGATGAA AAACTGAAGG GCTTTTCATC 3 00 

5 AATCCCTTTT ATTTTAGGGG AATTGAATAG ATAGTTTTAA ACTATA CGAA TTATTAATAT 3 60 

TTGAGATTTA ATTGAAATAA GTTTTAAAAA TTGGAGGAGA TAGATTAAGC GAAGTCATTT 4 20 

AAAGGTGAAG TTAAGTGTAT TCACAAAAAn TAGCCACACT CATATGACAT CGGATGAGTG 4 80 

10 

TGGCTTAAGG AT CTATGGGG GGAGGAAnCC ATAGATGTTT ACTTTGATAG GCCAGATTAA 54 0 

ATATCAAAGT ATG CGATTAT TTATAGCTTG ATGCAAAAGT GGTATGCCTA TTAAAAGTTA 6 00 

CTG CACATAG CTTTTAATAT TCCGTTCAAA GGAAAGGGGC ATACAATTGA ACAATCTGTA 6 60 

15 

ATAGTACTTT TAACCAGCTA TGCTAAAAGT CTAGTAGGGA GAACAGTTGT CCAATCACAT 720 

AAGAACCTCT AACTTCGTTA GTACGATTAA GAAAAGCTTT TTAGTTAGTA TGTAATACAA 780 

2Q TTTATTGACG CGCGTGAATC T CTTTTATAA GAGTGTGTAG GGAATGGCGT TGTATAAATT 84 0 

GTATTAGAAG AACTTCTAAC GCATCTCTGT GGTTAAAAGA GATGAAGGGA ACGACAGTTT 9 00 

aATTAAAACT GCATAAGAAC TTCTAGCTTT TCTCTCTCGT TCAAAGAGAA GCAGcTGTTC 96 0 

25 GCAGTTTAAT CAAAACCACA TAAAGCTTTT AACTTTACTC TTTGATTTAA AGAGTGATAA 102 0 

ATGTTTACAG TTTAATTAAA ACTG CATAAG AACTTCTAGC TTTTCTCTTT CGTTCAAgAG 10 8 0 

AAGCAGCTGT TCGCAgTTTA ATCAAAACCA CATAAAGCTT TTAACTTTAC TCTTTGATTT 114 0 

30 AAAGAGTGAC AAATGTTTAC AGTTTAATTA AAACTGCATA AGAACTTCTA GCTTTTCTCT 12 00 

TTCGTTCAAA GAGAAGTTCT AATACCACCA TATCGTGCGA TCGGGAACGG TATATATATT 12 6 0 

AATAGGAGGG TAATATATAT TTAACGCACG ATATGGGACT ATTAGCCTTC GACTTTGTTA 13 20 

35 TGTTGATGTG TGGCCTAAAA TATTGGAGAT AC CAATATTT TAGGTTGCAT CAACATCA 13 78 

(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 469: 

TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT GGCTTAACAA ATTGCAAATA 60 

50 TCGTTTAATC ATACATCCCC C CTAAT CTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 120 

AATAGAGGTT G G AT ACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 180 

55 
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TTATATTCTA TTCAATTTAA TCTATGGATA CTGTGTCCCC ACACGACAGC AAAAGTTATC 30 0 

ATACTTCTTT ACATCACTAA GTCAATATAA ATGATTTAAT CAGTATTTAC ACTTTATTTG 360 

5 CTTAATACTG TCTAATTTTT TTGTAACGTT CTTTCCAAAC TTTGATAAAA TCTGGCGCGA 42 0 

ATGGGCCCTT CTTCTGTTCT ATCCATTGTT GAAGAATGTC CACGTTGCGT CTTAAAATAA 4B0 

TATCAATATC ATGCGGATAA TTCATTTGAT TCATATG TTG CTCATATTCA TCTTCATCTA 540 

10 

ATAAATGATA CTTTCCGTTT GGATATACTT TAATATCTAA ATCATAGTCT ATATATTTTA 600 

ATGCCTCTTC ATCACAAACA AATGGTGATG ACAAATTGCA ATAGTAATAA ATTCCATCTT 660 

CTCTAAACAT GCAGATAACA TTaAACCAAT ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 72 0 

15 

CACGTGTTAT CCAAGTTCTT CCGTCACTTT CAGTCACTAA CGTATGATCA TTTCCACCAA 780 

TGACAACATG ATCAGTACCC TTTAATATTG TTGTTTCAGA CCAAACGCGA TGAATCTTAC 84 0 

20 CATCATGTTT ATAACTCTGA ATTTTAATGT TTTCCCCTTC TTTAGGTATG GATT CTCTGA 90 0 

CCATACTCCA CACCACCTTC TGTTAATTTA AC CATTATAA ATTATAGCAT ATTTCAGAAA 96 0 

TAGTATTATA TAAATACATA TTTTTACGAA ATAAGATTTT ACTACTTAAT AATTAAACTC 102 0 

25 GGTAATATTG CTAAGTACTA CAACAGAGAT TTACATGTCC CATTTAAAGT ATATAAAATC 108 0 

ATCACTTTTA TATATCAACA CTTTAACTTT TTGACATTGT TATTCTATGA GATTTAAAGA 114 0 

TATCATTTAT ACTTTTTAAA ATTAATGTCA CTATGTTTTC CGATAATATT ACCAATCATC 12 0 0 

30 GAATGTTACC CATTTATAAA TTGATAAATs TTTGACATAG GTACAGGGAA TGTATATTGA 126 0 

TCTCGATCAC TTAAATCAAA CCAAATCATG TCATCTGGTA ATGTTTCAAT GTTAATTGCT 132 0 

CCTGAAACGG CGTATACTTT AATCTTCCAT GTTAAATGAG TAAATTGATG CTTCAACTCA 1380 

35 AAAATAGGTG TTTCTACTGG TTGAATGTCA TGACCGATTT TTTCAGTCAT TTTACGTCTA 144 0 

GCATGCTCAC TTTCAAACAT AGGAAATTGC CACATACCAT GCAATAATTT TTCGCTACGC 150 0 

TTTTGCAACA GATATTGACC TTGATTATTT CTAATTAAAA AGACGGATTG CTCAATTACT 156 0 

40 

TTTTTACTTA CATTTTTAGA TTTAACAGGT AACTTTTCAA ATGTACCTTT ATCAAATGCC 1620 

TCACAGTTTT CTTGAACTGG ACAAAATAAG CATAATGGAT TTTTTGGTGT ACAAATTAAC 16 8 0 

45 GCCCCTAATT CCATCATAGC TTGATTAAAC GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 174 0 

AATTCTTGTT CGTACGATTT CCTCGTCGAT TGTAATTTAA TATCTCGATA GTCATCATTC 1800 

AATCTAGAC C ATACTCGAAA AACATTTCCG TCTACAGTTG CTAGTGGTAC ATTATATGCA 186 0 

so ATGCTCATTA CTGCAGCTTG TGTGTATGGG CCAACACCTT TTAACGCTTT AAATTGATCA 192 0 

GGATCTTTGG GAACTAAGCC TTCATATTTA TCATGAACTT CTTTAATCGC CGTATGAAAA 198 0 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 2100 

ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 2160 

5 GTCGTTTGTC GCCATGGCAT TTCTCTTTGA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 222 0 

TTAAAACTAG ACTGCTGATA CATTTATAAA ACCCTTTCCT CACCAAAATT AATTGTCTTT 2280 

ACTCATAATG TTTTTATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 23 4 0 

W 

TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATGAGTGAAC TGATATGGAC 24 00 

ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATC CAGCA 2 460 

ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 252 0 

15 

GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTCGCATCA TAGAGGTATC 2580 

ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATTTTT AATATTCACG 264 0 

20 TTCTTTAGTG GAACCAACCC ATTT CATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 2 700 

CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 2 76 0 

AAATGGATTC AATTAAGTGT GATTAACACA TTTGACCCTA TTATTTTCAC AGTTCTTTGT 2820 

25 ATTGGTATTG TATTATGGGT TATAGGCTTG CATCCATTTG CAGTCTT CTT TCCTATAATC 2880 

GCTTTACTAA T CAT TTATTA CATGATTCGT TTTAAAATGA GAGCCGTAAT TAAGCAACAA 2 94 0 

GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTGTTGC GCCAACAATA 3 00 0 

30 AAATTTATGG AATGG CGTGT CGCGATACAA ACTGATGCAC ATGACTATGT TGGAAAAGCA 3 060 

TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 3120 

ATTTTATGGA AAGTCAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTT TTCATCAATC 3180 

35 TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTCGTTT GATTGATTTG 3240 

CGTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTAAC AAACGATAAT 3 300 

GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 3360 

40 

TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 3420 

GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 3480 

4S TGCTTTTACA AAATCTAAAA TTTATATTGT TGCTAACAAA CTACCTTTAA TGACTCGAAA 3 54 0 

TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATT CTCA 3 600 

CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 3 660 

SO CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 3 720 

ATTGCGTGTG TCGTATACCT TGCTAATACG AAATTTACTG CAGCTATTAA ACATAAGAAT 3 78 0 
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TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 3 900 

ACATGATTTG GATTTGGTAT GAAAATAATT GGTAACAATA AGAAACACAA TACACAAACC 3960 

5 CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACGGAT 4 020 

ATTAATCCTA CAAAAATAGT TGCCATCACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 4080 

TCGAAGTTTA CTGGTTTAAC CAATGCACTA ATCATCGTCA AAATGACTAG CATAATAAAA 414 0 

10 

CTTAAAATAG ACATAATAAC TACTGGCGTC G 4171 

(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 9821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

TGGTTGAAGT AGCAGTTAAT TCAAAATCTG CAACAGTTTC AGCAGAATaG GGGCTTTCAA 60 

25 AATAAtCAAA GGAGAATAAT TTATGACTAA AACTTTAAAG GTTTATAAAG GAGACGACGT 120 

CGTAGCTTCT GAACAAGGTG AAGGCAAAGT GTCAGTAACT TTATCTAATT T AGAAGC GGA 180 

TACAACTTAT CCAAAAGGTA CTTAC CAAGT GGCATGGGAA GAAAATGGTA AAGAATCTAG 24 0 

30 TAAAGTTGAT GTACCTCAAT TCAAAACCAA TCCAATTCTA GTCTCAGGCG TATCATTTAC 300 

ACCAGAAACT AAATCAATTA TGGTAAATAC CGATGACAAT GTTGAGCCAA ACATTGCACC 360 

AAGCACAGCA ACGAATAAAA TATTGAAATA TACAAGTGAA CATCCAGAAT TTGTTACTGT 420 

35 AGATGAAAAT ACAGGAGCAA TTCACGGTGT AGCTGAAGGT AC TTCAGTAA TCACTGCTAC 480 

GTCTACTGAT GGAAGCGATA AGTCAGGACA AATTTCAGTG ACAGTAACAA ACGGATAGGG 540 

ATTTAAGGCG CAGTATATCT GCGTCTTTTT TATTTGAATA AAAGGAGCTA ATACAATGAT 600 

40 

TAAATTTGAA ATTAAAGATC GTAAAACAGG AAAAACAGAG AGCTATACAA AAGAAGATGT 6 60 

AACAATGGGC GAACAGAAAA ATGCTATGAG TATTTAGAAT TAGTAAATCA AGAGAATAAA 72 0 

4S AAAGAAGCAC CTAACGCAAC AAAAATGAGA CAAAAAGAGC GACAGTTATT AGTAGATTTA 78 0 

TTTAAAGATG AAGGATTGAC TGAAGAAGAT GTTCTGAACA AGATGAGTAC TAAAACTTAT 84 0 

ACAAAAGCCT TACAAGATAT ATTTCGAGAA ATCAATGGTG AAGATGAAGA AGATTCAGAA 900 

SO ACTGAACCAG AAGAGATGGG AAAGACAGAA GAACAATCT C AATAAAAGAC ATTTTATCGA 96 0 

ACATTAAGAA AATACAACGT TTCTGTATGG AGCAGTATGG GTGGACATTA ACTGAAGTCA 102 0 

ss 
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AAGAAAAACA AAGTGAACAA AAAGTCATTA CAGGTACGGA TTTAAGAAAA CTTTTTGGAA 114 0 

GCTAGAAAGG AGGTTAATAT GAATGAAAAA GTAGAAGGCA TGACCTTGGA G CTG AAATTA 120 0 

5 GACCATTTAG GTGTCCAAGA AGGCATGAAA GGTTTAAAGC GACAATTAGG TGTTGTTAAT 126 0 

AGTGAAATGA AAGCTAATCT GTCAGCATTT GATAAGTCTG AAAAATCAAT GGAAAAATAT 13 2 0 

CAGGCGAGAA TTAAGGGGTT AAATGATAGG CTTAAAGTTC AAAAAAAGAT GTATTCTCAA 13 80 

GTAGAAGATG AGCTTAAACA AGTTAACGCT AATTACCAAA AAGCTAAATC CAGTGTAAAA 144 0 

GATGTTGAGA AAGCATATTT AAAGTTAGTA GAAGCCAATA AAAAAGAAAA ATTAGCTCTT 1500 

^ GATAAATCTA AAGAAGCCTT AAAATCATCG AATACAGAAC TTAAAAAAGC TGAAAATCAA 156 0 

TATAAACGTA CAAATCAACG TAAACAAGAT GCGTATCAAA AACTTAAACA GTTGAGAGAT 1620 

GCAGAACAAA AGCTTaAGAA TAGTAACCAA GCTACTACTG CACAACTAAA AAGAGCAAGT IS 80 

20 GACGCrtACA GAAGCAGTCC GCTAAGCATA AAGCACTTGT TGAACAATAT AAACAAGAAG 174 0 

GCAATCAAGT TCAAAAACTA AAAGTGCAAA ATGACAATCT TTCAAAATCA AATGATAAAA 1800 

TTGAAAGTTC TTACGCTAAA ACTAATACTA AATTAAAGCA AACAGAAAAA GAATTTAATG 1860 

25 ATTTAAACAA TACTATTAAG AATCATAGCG CTAATGTCGC AAAAGCTGAA ACAGCTGTTA 1920 

ATAAAGAAAA AGCTGCTTTA AATAATTTGG AGCGTTCAAT AGATAAAGCT TCATCCGAAA 1980 

TGAAGACTTT TAACAAAGAA CAAATGATAG CTCAAAGTCA TTTCGG TAAA CTTGCAAGTC 204 0 

30 AAGCGGATGT CATGTCAAAG AAATTTAGTT CTATTGGAGA CAAAATGACT TCCCTGGGAC 2100 

GTACAATGAC GATGGGCGTA TCTACACCGA TTACTTTAGG TTTAGGTGCA GCATTAAAAA 2160 

CGAGTGCAGA CTTTGAAGGG CAAATGTCTC GAGTTGGAGC GATTGCACAA GCAAGCAGTA 2220 

35 

AAGACTTAAA AAGCATGTCT AATCAAGCGG TTGACTTAGG AGCTAAAACA AGTAAAAGTG 22 80 

CTAACGaAGT TGCTAAAGGT ATGGAAGAAT TGGCAGCTTT AGGCTTTAAT GCCAAACAAA 2 34 0 

CAATGGAGGC TATGCCAGGT GTTATCAGCG CAgcAGaAGC AAGTGGTGCA GAAATGGCTA 24 00 

40 

CAACTGCAAC TGTAATGGCT TCAGCGATTA ACTCTTTCGG TTTAAAAGCA TCTGATGCAA 2460 

ATCATGTTGC TGATTTACTT GCGAGATCAG CAAATGATAG TGCTGCAGAT ATTCAATATA 2 520 

4S TGGGAGATGC ATTAAAATAT GCAGGTACTC CAGCAAAAGC ATTAGGAGTT TCAATAGAGG 2 580 

ACACTTCTGC AGCAATTGAA GTTTTAT CTA ACTCAGGTTT AGAGGGGTCT CAAGCAGGTA 2 640 

CTGCATTAAG AGCTTCGTTT ATTAGGCTAG CTAATCCAAG TAAAAGTACA GCTAAGGAAA 2700 

50 TGAAAAAATT AGGTATTCAT TTGTCTGATG CTAAAGGTGA GTTTGTTGGA ATGGGCGAAT 2760 

TGATTAGACA GTTCCAAGAT AACATGAAAG GCATGACGAG AGAACAAAAA TTAG CAACAG 2 82 0 
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CAGATAAAAT TAATAGCTAT AGCAAAT CAT TGAAGAACTC TAATGGTGAA AGTAAAAAAG 2 940 

CAGCTGATTT GATGAAAGAT AACCTCAAAG GTGCTCTGGA ACAATTAGGT GGCGCTTTTG 30 00 

5 AATCGTTAGC AATTGAAGTT GGTAAAGATT TAACGCCTAT GATTAGAGCA GGTGCGGAAG 3 0 60 

GATTAACAAA ATTAGTTGAT GGATTTACAC ATCTTCCTGG TTGGGTTAGA AAGGCTTCGG 312 0 

TAGGCTTAGC AATTTTTGGT GCATCTATTG GTCCTGCTGT TCTTGCTGGT GGCTTATTAA 3180 

10 

TACGTGCAGT TGGGAGCGCG GCTAAAGGCT ATGCATCATT AAATAGACGC ATTGCTGAAA 324 0 

ATACAATTCT TTCTAATACC AATTCAAAAG CAATGAAATC TTTAGGTCTT CAAACATTAT 3 300 

TTCTTGGTTC TACAACAGGA AAAACGTCAA AAGGCTTTAA AGGATTAGCC GGAGCTATGT 33 SO 

T5 

TGTTTAATTT AAAACCTATA AATGTTTTGA AAAATTCTGC AAAGCTAGCA ATTTTACCGT 34 20 

TCAAACTTTT GAAAAACGGT TTAGGATTAG CCGCAAAATC CTTATTTGCA GTAAGTGGAG 34 80 

2 0 GCGCAAGATT TGCTGGTGTA GCCTTAAAGT TTTTAACAGG ACCTATAGGT GCTACAATAA 354 0 

CTGCTATTAC AATTGCATAT AAAGTTTTTA AAACCGCATA TGATCGTGTG GAATGGTTCA 36 00 

GAAACGGTAT TAACGGTTTA GGAGAAACTA TAAAGTTTTT TGGTGGCAAA ATTATTGGCG 36 60 

25 GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA ATTATCTTGG AAGTATAGGC AAAAGCTTCA 3 72 0 

AAGAAAAGTT TTCAAAGGAT ATGAAAGATG GTTATAAATC TTTGAGTGAC GATGACCTTC 3 7 80 

TGAAAG TAGG AGTCAACAAG TTTAAAGGAT TTATGCAAAC CATGGGCACA GCTTCTAAAA 3 84 0 

30 AAGCATCTGA TACTGTAAAA GTGTTGGGGA AAGGTGTTTC AAAAGAAACA GAAAAAGCTT 3 900 

TAGAAAAATA CGTACACTAT TCTGAAGAGA ACAACAGAAT CATGGAAAAA GTACGTTTAA 3 96 0 

ACTCGGGTCA AATAACAGAA GACAAAGCAA AAAAACTTTT GAAAATTGAA G CGGATTTAT 4020 

35 

CTAATAACCT TATAGCTGAA ATAGAAAAAA GAAATAAAAA GGAACTCGAA AAAACTCAAG 4 080 

AACTTATTGA TAAGTATAGT GCATTCGATG AACAAGAAAA GCAAAACATT TTAACTAGAA 414 0 

CTAAAGAAAA AAATGACTTG CGAATTAAAA AAGAGCAAGA ACTCAATCAG AAAATCAAAG 42 00 

40 

AATTGAAAGA AAAAGCTTTA AGTGATGGTC AGATTTCAGA AAATGAAAGA AAAGAAATTG 4260 

AAAAGCTTGA AAATCAAAGA CGTGACATCA CTGTTAAAGA ATTGAGTAAG ACTGAAAAAG 432 0 

4S AGCAAGAGCG TATTTTAGTA AGAATGCAAA GAAACAGAAA TGCTTATTCA ATAGACGAAG 4 3 80 

CGAGCAAAGC AATTAAAGAA GCAGAAAAAG CAAGAAAAGC AAGAAAAAAA GAAGTGGATA 44 4 0 

AG CAGTATG A AGATGATGTC ATTGCTATAA AAAATAACGT CAACCTTTCT AAGTCTGAAA 4 5 00 

50 AAGATAAATT GTTAGCTATT GCTGATCAAA GACATAAGGA TGAAGTAAGA AAGGCAAAAT 4 56 0 

CTAAAAAAGA TGCTGTAGTA GACGTTGTTA AAAAGCAAAA TAAAGATATT GATAAAGAAA 46 2 0 
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GTTGGTGGTC TAACTTTAGA GAAGACCAAA AGAAGAAAAG TGATAAATAC GCTAAAGAAC 4 74 0 

AAGAAGAAAC AGCTCGTAGA AACAGAGAAA ATATAAAGAA ATGGTTTGGA AATGCTTGGG 4 80 0 

5 ACGG CGTAAA AACTAAAACT GGTGAAGCCT TTAGTAAAAT GGGCAGAAAT GCTAATCATT 486 0 

TTGGCGGCGA AATGAAAAAA ATGTGGAGTG GAATCAAAGG AATTCCAAGC AAATTAAGTT 4 920 

CAAGTTGGAG CTCAGCCAAA AGTTCTGTAG GATATCACAC TAAGGCTATA GCTAATAGTA 4 9B0 

CTGGTAAATG GTTTGGAAAA GCTTGGCAAT CTGTTAAATC GACTACAGGA AGTATTTACA 504 0 

ATCAAACTAA GCAAAAGTAT TCAGATGCCT CAGATAAAGC TTGGGCGCAT TCAAAATCTA 5100 

TTTGGAGAGG CACATCAAAA TGGTTTAGCA ACGCATATAA AAGTGCAAAG GGTTGGCTAA 516 0 

1S 

TAGATATGGC TAATAAATCG CGCTCGAAAT GGGATAATAT TTCTAGTACA GCATGGTCGA 522 0 

ATGCAAAATC CGTTTGGAAA GGAACATCGA AATGGTTTAG TAACTCATAC AAATCTTTAA 5280 

20 AAGGTTGGAC TGGGGATATG TATTCAAGAG CCCACGATCG TTTTGATGCA ATTTCAAGTT 534 0 

CGGCATGGTC TAACGCTAAA TCAGTATTTA ATGGTTTTAG AAAATGGCTA TCAAAAACAT 540 0 

ATGATTGGAT TAGAGATATT GGTAAAGACA TGGGAAGAGC TGCGGCTGAT TTAGGTAAAA 54 6 0 

25 ATGTTGCTAA TAAAGCTATT GGCGGTTTGA ATAGCATGAT TGGCGGTATT AATAAAATAT 5520 

CTAAAGCCAT TACTGATAAA AATCTCATCA AGCCAATACC TACATTGTCT ACTGGTACTT 5580 

TAGCAGGAAA GGGT GTAGCT ACCGATAATT CAGGAGCATT AACGCAACCG ACATTTGCTG 5 64 0 

30 TATTAAATGA TAGAGGTTCT GGAAACGCCC CAG GTGGTGG AGTTCAAGAA ATAATTCACA 570 0 

GGGCTGACGG AACATTCCAT GCACCCCAAG GACGAGATGT GGTTGTTCCA CTAGGAGTTG 5760 

GAGATAGTGT AATAAATGCC AATGACACTC TGAAGTTACA GCGGATGGGT GTTTTGCCAA 58 20 

35 AATT CCATGG TGGTACGAAA AAGAAAAAAT GGATGGAACA AGTTACTGAA AATCTTGGTA 5880 

AAAAAGCAGG GGACTTCGGT TCTAAAGCTA AAAACACAGC TCATAATATC AAAAAAGGTG 5 940 

CAGAAGAAAT GGTTGAAGCG GCAGGCGATA AAATCAAAGA TGGTGCATCT TGGTTAGGCG 6 000 

ATAAAATCGG CGATGTGTGG GATTATGTAC AACATCCAGG GAAA CTAGTA AATAAAGTAA 6060 

TGTCAGGTTT AAATATTAAT TTTGGAGGCG GACTAACGCT ACAGTAAAAA TTGCTAAAGG 6120 

4s CGCGTACTCA TTGCTCAAAA AGAAATTAGT AGACAAAGTA AAATCGTGGT TTGAAGATTT 618 0 

TGGTGGCGGA GGCGATGGAA GCTATCTATT TGACCATCCA ATTTGGCAAA GGTTTGGGAG 6240 

TTACACAGGT GGACTTAACT TTAATGGCGG TCGTCACTAT GGTATCGACT TTGGTATGCC 63 00 

50 TACAGGAACG AACATTTATG CTGTTAAAGG CGGTATAGCT GATAAAGTAT GGACTGATTA 63 60 

CGGTGG CGGT AATTCTATAC AAATTAAGAC CGGTGCTAAC GAATGGAACT GGTATATGCA 6420 
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ATCAGGTGCT ACAGGTAATT TCGTTAGAGG AGCACACTTA CATTTCCAAT TGATGCAAGG 6 54 0 

GTC G CAT CCA GGGAATGATA CAGCTAAAGA TCCAGAAAAA TGGTTGAAGT CACTTAAAGG 6 600 

TAGTGGCGTT CGAAGTGGTT CAGGTGTTAA TAAGGCTGCA TCTGCTTGGG CAGGCGATAT 666 0 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA TGTTACTTCG GCTGACGTAG GAAATAT CAT 672 0 

TAGCTTGATT CAACACGAAT CAGGAGGAAA TGCAGGTATA ACTCAATCTA GTTCGCTTAG 678 0 

AGACATCAAC GTTTTACAGG GCAATCCAGC AAAAGGATTG CTTCAATATA TCCCACAAAC 684 0 

ATTTAGACAT TATGCTGTTA GAGGTCACAA CAATATATAT AGTGGTTACG ATCAGTTATT 6900 

AGCGTTCTTT AACAACAGAT ATTGGCGCTC ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6960 

AAGTGGTCCA AGAAGATATG CGAATGGTGG TTTGATTACA AAGCATCAAC TTGCTGAAGT 702 0 

GGGTGAAGGA GATAAACAGG AGATGGTTAT CCCTTTAACT AGACGTAAAC GAG CAATTCA 7080 

ATTAACTGAA CAGGTTATGC GCATCATCGG TATGGATGGC AAGCCAAATA ACATCACTGT 714 0 

AAATAATGAT ACTTCTACAG TTGAAAAATT GTTGAAACAA ATTGTTATGT TAAGTGATAA 720 0 

AGGAAATAAA TTAACAGATG CGTTGATTCA AACTGTTTCT TCTCAGGATA ATAACTTAGG 726 0 

TTCTAATGAT GCAATTAGAG GTTTAGAAAA AATATTGTCA AAACAAAGTG GGCATAGAGC 732 0 

AAATGCAAAT AATTATATGG GAGGTTTGAC TAATTAATGC AATCTTTTGT AAAAATCATA 7380 

GATGGTTACA AGGAAGAAGT AATAACAGAT TTTAATCAGC TTATATTTTT AGATGCAAGG 744 0 

GCTGAAAGTC CAAACACCAA TGATAACAGT GTAACTATTA ACGGAGTAGA TGGTATTTTA 75 00 

_ CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA TTAGTATTAA GGTTTGGCTA TGAT GGTAT A 7560 

GATGTTATAG ATTTAAATTT ATTTGAGCAT TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 762 0 

CCTTATTATG TTATTACTTC TCAAATGCCT GGTGTTAAAT ATGCAGTGAA TACA GCTAAT 7680 

GTTACATCTA ATTTAAAAGA TGGTTCTTCA ACTGAAATTG AAGTAAGTTT AAATGTTTAT 774 0 

AAAGGGTATT CTGAATCAGT TAATTGGACC GATAGCGAGT TCTTATTCGA CTCTAATTGG 7800 

ATGTTTGAAA ATGGAATTCC TCTTGATTTC ACAC CTAAAT ATACTCATAC ATCAAATCAA 786 0 

TTTACTATTT GGAACGGTTC TACTGATACG ATAAATCCAC GATTCAAGCA CGATTTGAAA 7920 

ATATTAATTA ATTTAAATGC GAGTGGAGGA TTTGAACTGG TTAATTATAC AACAGGTGAT 79 80 

ATTTTTAAGT ACAACAAAAG TATAGATAAA AACACTGATT TTGTTTTAGA TGGTGTGTAT 804 0 

GCATATCGAG ATATAAA CAG AGTGGGAATT GATACAAATA GAGGCATTAT AACATTAGCG 810 0 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA GACGTCAGTG ATATTAAAAC TACATTTAAG 816 0 

TTTCCTTTTA TTTATAGGTA GGTGATTTAA TGGATTATCA TGATCATTTA TCAGTAATGG 822 0 
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ATTATGAACT 


GAATGAAGCT 


AGGTACATCA 


CCTTTACAGT 


TTATAGAACT 


ACTCATAATA 


8340 




GTTTTGTTTT 


TGATTTATTG 


ATTTGTGAAA 


ACTTCATAAT 


TTATCATGGT 


GAAAAATACA 


8400 


5 


CAATTAAGCA 


GACAGCGCCA 


AAGGTTGAAG 


GTGATAAAGT 


TTTTATTGAA 


GTTACGGCAT 


8460 




ATCACATAAT 


GTATGAATTT 


CAAAATCACT 


CAGTGGAATC 


AAATAAGCTT 


GATGACGACA 


8520 


10 


GTAGCGAAAC 


TGGTAAAACG 


CCAGAATACT 


CTTTAGATGA 


GTACTTAAGA 


TATGGATTTG 


8580 


CAAATCAAAA 


AACGTCAGTC 


AAGATGACCT 


ATAAAATAAT 


TGGAGATTTT 


AAAAGAAAAA 


8640 




TACCAATTGA 


TGAATTAGGT 


AATAAAAATG 


GCTTAGAATA 


TTGTAAAGAA 


GCAGTAGATT 


8700 


15 


TGTTTGGTTG 


TATTATTTAT 


CCAAATGATA 


CGGAGATATG 


TTTTTATTCT 


CCTGAAACAT 


8760 




TCTATCAAAG 


AAGCGAAAAA 


GTAATAAGGT 


ATCAATATAA 


TACTGATACT 


GTGTCTGCTA 


8820 




CTGTCAGTAC 


GTTGGAATTA 


AGAACAGCTA 


TAAAAGTTTT 


TGGGAAAAAG 


TACACAGCCG 


8880 


20 


AGGAAAAGAA 


AAATTATAAT 


CCTATTAGAA 


CAACTGACAT 


TAAATATTCA 


AATGGTTTTA 


894 0 




TAAAAGAAGG 


TACTTATCGT 


ACCGCAACAA 


TTGGGTCTAA AG CT ACTATT 


AACTTTGATT 


9000 




GCAAGTATGG 


TAATGAAACA 


GTTAGATTTA 


CAATAAAaAA 


GGGCTCTCaA 


GGTGGAATAT 


9060 


25 


ATAAGTTGAT 


TTTAGACGGC 


AAGCaAATTA 


AGCaAATTTC 


TTGTTTTGCT 


AAGTCGGTTC 


9120 




AGTCTGAmAC 


AATAGATTTA 


ATAaAAAATA 


TTGATAAAGG 


CAAGCACGTT 


TTAGAAATGA 


9180 




TATTTTTrGG 


AGArGrCCCC 


AAAAATAGAA 


TTGATATATC 


TTCAAATAAA 


AAAGCTAAGC 


9240 


30 


CTTGTATGTA 


TGTTGGAACT 


GAAAAATCAA 


CAGTCTTAAA 


TTTAATTGCT 


GACAACTCAG 






GTCGCAATCA 


ATACAAAGCA 


ATTGTTGa CT 


ACGTCGCAGA 


TAGTGCAAAG 


CAGTTTGGGA 


9360 


35 


TTCGATATGC 


TAATACGCAA 


ACAAATGAAG 


ATATCGAAAC 


ACAGGATAAG 


CTGTTAGAAT 


9420 




TTGCAAAAAA 


GCAAATAAAT 


GATACTCCTA 


AGACTGAATT 


AGATGTTAAT 


TATATAGGTT 


9480 




ATGAAAAAAT 


AGAGCCAAGA 


GATAGCGTAT 


TCTTTGTTCA 


TGAATTAATG 


GGATATAACA 


954 0 




CTGAATTAAA GGTTGTTAAA 


CTTGATAGGT 


CACATCCATT 


TGTAAACGCA ATAGATGAAG 


9600 




TGTCTTTCAG 


CAATGAAATA 


AAGGATATGG 


TACAAATTCA ACAAGCGCTT 


AACAGACGAG 






TTATTGCACA 


AGATAATAGA 


TATAACTATC 


AAGCAAATCG 


TATAAATCAT 


TTATACACTA 


9720 


45 


GTACTTTGAA 


TTCTCCTTTC 


GAGACAATGG 


ATATAGGGAG 


TGTATTAATA 


TAATGGCAAC 


9780 




AGAAGAAGTT 


AAAATCAAAG 


CGCTACTTGA 


AAACGATAAA 


C 




9821 




(2) INFORMATION FOR SEQ ID NO: 471: 









SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

^ TTATTGTTTT CCAAATGGTC TTCTCGGTTA TTTAACCATC GGTTTCAATC CTACGTG CTT 6 0 

CAC TTTCTAT CtATTAATTC AaTTTCAGCT TGACCACCCG CTGTATAAAG GGTCAAAGTT 120 

GCTAATCGAT AGCGTCTCAT TATAGGACCA ACATCGATAT CAATATTTTG AATACGAAAA 180 

w TATGGTATTA CCTTTTCATC CAAAAATAAA ATGCCGTTTC GTACACGCAA ATGGTGTTTT 240 

TCAAATG CAT ATCTGCAGTG CTTATATCGA TAGACTGGCG CTATAACAAG CGTGAAAACA 300 

GCAACAAGTA ATATTATAAT CACACTACTA ACAATGGATA AATGGTTATC TAAAATCTCC 3 60 

15 CAAAATAGCC AGTTCAAAAT ATTAAATGCG ATTAAAAGTA CAAGCGCTAT GGGTATCCAA 420 

AACAGCACAG CACTTAACCT CATCACTTTT TTAGCGTGTG GTGACATAAA ATTATAATCC 4 80 

CTCATCATTT TCACCTCTTA AATACCATGA TTTCAATTTA TTTGCATCTT CACTTCTTGC 54 0 

SO 

GTATTTTAAG TTAATCGATT GGGCTCCCAC AC CTTTAGCA ACAATAAAGC TAAAATTATT 6 00 

TAAATTGTTG CGTTTAAGTA ATGTATGTTG CCAAGTGTCA AATCCTATAA TGTGATG CGC 6 60 

TTTAAAATAA TAAATATTTC GTTTCAATAG CTCGAAATTC TGGATAGTAA TTTGTTCTTC 720 

25 

TGTCATTTTA AAACCCGCAT GTTTGACATA AAGATATCCT TTGATCACAA AT AAAC CAAT 780 

AATGACTATT GTTATAATCG TAAATAACAA TAATAATTGA TTCCAAAAAT AACAGCCTAT 840 

3Q ACCTGCCATA GCTATGACAA TAATACTAGG TATTAAAAAG TGT CTGTGG A AACCTGACAA 9 00 

AGG CATACCT TCATTAACTT GTTGATAAGA TAAATCTGGT ACTAAATTCT GGATAATTTG 96 0 

ATATGCTTTG TCTCGTTTAA TAAACGGCaA TATCGGCACA CTACCTGAAT CATTGTC 1017 

35 (2) INFORMATION FOR SEQ ID NO: 4 72: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 6 806 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

45 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 60 

GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 120 

so TTATAGCAAA TCGGATAGTG ATTTTTAATC ATAGACACAA AATCATCTTG TGCATGTATA 180 

TATTCTTTTC TTCTTAATCG G C GAATT AAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 240 

55 
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TTAATAACAT TATTnGATCA AGGTCATCTC ATGCATAGAT AAATCTTCTG TATTaGATGC 360 

AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 4 20 

5 TAACTGATCA ATCACCTGTT TAG CAATATG ATATGCATCA CTATATAACT GCATAGTTTC 4 80 

CATAACAAAT GGATTGCTAA TAACTTGATT TTG T TTTAAG CGTTTATAAG CAAATATAAT 54 0 

ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT 600 

'° AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCGATTT CAACTAAACT 660 

TTTATAAT GT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTTCTCTA TTGTAATAGT 72 0 

TTGGTCGTTT AACGCCATTC CCTCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 780 

15 

AACTTCTTGA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 84 0 

ATATTCTCCC ATTATCATCA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 900 

oo AACTTGAAAG TATAATAATT TAAATACTTG TGTTCTATTA CACGCCTATC CTATATGATA 96 0 

TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 102 0 

CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 108 0 

25 CTAAAACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATATTCATC TATCCAACAA 114 0 

TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 120 0 

TATATTAATA TATAATCAAA TCTTAGATTA ACTAGTGTAA TGATACAGAT GATAATTGAG 126 0 

30 TACAAATTTA AAACC CTGAG ATTTTCGCTT TAATTTGAAA ACCTCAGGGT TTATTTGATT 132 0 

TTTATATAAT GAATCGTTAC ATTAAAAATA TTT ATTTAT C AG AGTT CTTA TATTTGTTAG 1380 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 144 0 

35 

TCTTGCCTTC TTTTTTCTGC TTTTGATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA 150 0 

GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 156 0 

TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 162 0 

TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 168 0 

AGTAATTAC C TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC 174 0 

45 CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTG CAACA C CTTGAGGTG 180 0 

CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAAT CAT GATTGCAGTT GCTGTACAAA 186 0 

CCAACATTGT ATCAAAGAAC ACACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGG TGTG 192 0 

50 GTACGGCAGC AGTTGCCGCT GCATTCGGCG CAGAACCCAT ACCAGCTTCG TTAGAGAATA 198 0 

AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 2 04 0 
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GCAATATTAC TAAAACCATA CCAATGTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 2160 

ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 2 220 

TAATACCTGT GATTACTGGA CTAATATTAT ATTGCGTATT TAACGACTCC GCAATTGTAT 2 280 

T AGATTG CAC TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 2 34 0 

TACCTAGCGA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGG AC CA CCACGGAATC 24 00 

CACCATCTTT ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 2460 

CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA 2 52 0 

TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTCT CGAA CCAGCA CTAATCGCAA 2580 

ATGCTTGGAA TGGCGAAATA CCCTTCTTAC CATCTTCTAA AGTTTCTGGA CGTTCTACTA 2 64 0 

AAGCTCTAAA CATTTCAGGT AACATT CGTA ATTGAACGAA TTTAGAACTA ATCGTAAAGA 2 700 

AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA CCATATTAAA TCGGTACCAA 2 760 

CATGGACAAA TTCTTTAAAC CATCCAGGTA TTAAACTATC GAAATCTTTC AAAATAAACC 2 82 0 

CCTCGCATCC TCTACATGAA TCATGTACCT TCTATAAAAT TAGACCGAAT TGAACCTTCA 2 880 

GTAAATATAG AGATACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGTTTGGT 2 94 0 

CAAAGTATAT CGCTAATTTA ACGATAAGTA CTTGGTCAGC ATTTAATATA AATCCCTTGA 3 00 0 

ATTTAGT CAA AATTTAACAT TA CTGTATTT TATCATTTAA TTTCGTGATT GCATATAGTT 3 060 

TTTAG CTAAT ATACATGTCT ATTACTTCAC CAAAATCATC TGTATCTACA ATGAATGAGC 3120 

CATTTGTATA TTGTTCAGAT TTATGAATAT CATTAATTAA ACCATGTTCT TCATTTGATT 3180 

TTGAATATAA TGTATATTGA CTATGTTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 3 24 0 

GTGGATTTTT CTTTAATTCT TTTAATAAAG TTATTCCaCG TTGTGCTCTT TTAGCAACTT 3 300 

GTAAGATTTT AAAACTAATA CGTTTTAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 3360 

CATTTTCAGA AACACCTTCT GTCATAACAA CG AAATCTT C AGCTTTAAGA TTTATTG ATT 34 20 

TAACACCAGC TGCC CTTAAT CCGGTATCTG ATAGTTCACT TGTATTATAC GTTAATGACA 34 80 

TACCTTTATT AGTAATGACG GTAATTAATT GATCTTTTTC AAAGCGCATA ACACTAATCA 3 54 0 

AATCATCATT TTCTTTAACT TTAGTAGCAA TTAAAGGTTT ATTAAAACGC GTTGTTTTAA 3600 

ATAGAGGCAC TGTACTTTTC TTAATCATGC CATTTTG AG T CGCAAAAACA T AAAATG CAT 3660 

CTGTATTAAA GTCCTTTTCA TTAAAGACAT TAATAACCAC TTCATCTTCT TCGATAGGAA 372 0 

CTATTTGTGA TACATGTTGT CCCAATTCTT TCCAACGAAT ATCTGCTAAT TTATGAACCG 3780 

GTATAAATAG ATAACGACCT TTATTTGTAA ATACTAGTAC GGTATCTTGC GTATTTACTT 3 84 0 
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TAAAGCTACG AATAGAAGTA CGTTTAATAT ATCCATGACG TGTCATACTT AAAATAACTy 3 960 

CTTCACTAGG CACCATAACT TCTTTGTCAA TTTTAATTTC TTCAATTTCT GCTTCAATTA 4 02 0 

5 AAGACAGTCG TTCAGATTTG AATTTCTTTT TAATTTCATT CAATTCTTCT TTTATGACAT 4 08 0 

TCAATAATGC ATCATGGTTA TCAAGAATAT GACGTAATTG TTTGATTAAT GCTTCAAGTT 414 0 

CTTTATGTTC ACCTTCAAGC GCAACTATGT CAGTATTTGT TAAACGATAT AACTGTAACA 4200 

'° TTACAATTGC TTCAGCCTGT TCTTCTGTGA ACTCGTATAC TTCGATAAGG TTTTCTTTAG 4 260 

CGTCACGCTT GTTTTTAGAG CwACGAATCA ATTCGATTAC TTTAT CTAAA ATTGACAACG 4 32 0 

CTTTAATCAA AC CTTCAACG ATATGCATAC GTTTTTCTGC ATTATCTAAT TCAAACTTCG 4 3 80 

IS 

TTCTATTTGC AACAACCTCA ATTTGGTGAT TCAAATAACT ATCTATAATT TGACGAATAC 444 0 

CCATCAATTT TGGACGACCA TCACTAATAG CGACCATGTT GAAATTATAT GAAATCTGTA 4 500 

2o AATCAGAGTT TTTAT AAAGA TAATTTTTGA TTGATTCACT GTTCACATCT TTTTTCAATT 4 560 

CAATTGCTAT TCGTAAACCA GTTCTATCAG TTTCATCACG TACTTCAACG ATACCATCGA 4 620 

CTTTTTTGTC AGCACGTAAT TCATCGATAC GTTTTACTAA GCTACTTTTG TTCACTTCAT 4 680 

2s ATGGAATTTC AGTAATAATT AACTGTTTAC GTCCATTGCG TAAAGTTTCT TCTTCAACTT 4 74 0 

TAGAACGAAC TATAATTCTA CCTTTACCTG ATTCATAAGC TTTTTTAATA CCATCAATAC 4 80 0 

CTTGAATAAT ACCACCAGTT GGAAAATCAG GACCTTTAAT ATATTTCATT AATTGATTGA 4 86 0 

30 CTGTAATATC CGGATTATCA ATATATTTAA GTGTTGCTTG AATCACTTCA GCTAAATTAT 4 920 

GTGGTGGTAT ATCTGTCGCG TAACCTGCAG ATATACCTGT AGAACCATTC ACTAGTAAGT 4 980 

TAGGAAATCT TGATGGCAAT ACCATTGGTT CGAGTGTCGT ATCATCATAG TTTGGAATGA 504 0 

35 AAGAAACTGT CTCTTTATTA ATATCACGTA ATAACTCTTC AG CTAGTAAG CTTAACTTAG 5100 

CTT CAGTGTA ACGCATTGCC GCTGGCGGAT CATTATCGAT ACTACCATTA TTACCATGCA 5160 

TTTCTATTAA GACATGTCGT AACTTCCAGT CTTGACTTAA ACGGACCATT GCTTCGTACA 5 220 

40 

CTGAGGAGTC TCCATGTGGA TGATATTGAC CAATAACATC ACCGACTGTT TTCGCACTTT 52 8 0 

TACGGAAATT TTTATCGTGT GTATTACCAC TTGAATACAT TGCATATAAA ATACGACGTT 5 34 0 

^ GTACTGGTTT TAAACCATCA CGAACATCTG GCAATGCACG CTCTTGAATA ATATATTTAC 54 00 

TATATCTTCC AAAGCGATCA CCTAAAACAT CTTCAAGTGA TAAATCTTGA ATTATTTCAC 54 60 

TCACTAGATT TCCTCCTCAT CAAATTGATC ATTTTCAAGC ACTTGTACTT CAGAATTATC 552 0 

SO TAAAATACTT TGGTCCTCTT GCATACCAAA CTCAACATGC TTTTCAATCC ATTCACGTCT 5 5 80 

AGGTTGTACT TTGTCACCCA TTAATGTTGT TACACGTTTA GATGAACGCA CTTCATCTTC 5 64 0 
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AGGGTTCATT TCACCCAAAC CTTTGTAACG TTGTAACGTG AaGCCTTTAC CAAGTTCTTT 576 0 

TTGCAATTTA TTAAGCTCTT CGTCTGTCCA AGCGTATTCA ACTCGCTTTG TTTTGCCTTT 5 82 0 

£ ACCTTTTTCC AATTTATAAA GTGGAGGTAA AG CAATAAAT ACACGACCTG CTTGAACAAG 5 880 

CGGTTTCATA TATTTGAAGA AGAATGTTAA CAATAGCACT TGAATATGCG CACCATCAGT 5 94 0 

ATCAGCATCA GTCATAATAA TTACACGATT ATAATTACTA TCTTCAATTT TAAAGTCAGT 6000 

'° ACCAACGCCT GCCCCGATTG TGTGGATAAT TGTATTAATT TCTTCATTTT TAAAAATATC 606 0 

TTCTAGACGT GCTTTCTCTG TATTAATTAC CTTACCACGT AATGGTAATA TCGCTTGGAA 612 0 

TTTGCGGTCT CGTc CAAGTT TTGCTGAACC TCCCGCAGAA TCACCTTCGA CTAAATACAA 618 0 

15 

TTCATTTTTT TCAGTGTTTT TACTTTGTGC AGGTGTTAAT TTACCAGATA GCAAAGTGTC 624 0 

TTTACG CTTG TTTTTCTTAC CTGAACGAGC ATCTTCACGA GCTTTACGTG CAGCTTCCCT 6300 

so TGCTTGTTGT GCTTTAATCG CTTTTTTCAC AAGTGATTTA GACAATTGTC CTTTTTCTTC 6360 

TAAATAGAAT GGCAATTTGT CTGCAACAAC TGAATCAACA GCACTTCTAG CTTCAGAAGT 6420 

ACCCAATTTA GATTTCGTTT GTCCTTCAAA TTGCAATAAT TCTTCTGGAA TACGAACAGA 64 8 0 

25 CACAACAGCT GTTAAACCTT CACGAATATC ATTACCATCT AAGTTTTTAT CTTTTGTTTT 6 54 0 

AAGTTCATTA ATACGACGTG CATAATCATT AAATACACGT GTCATTGCTG TTTTAAAACC 6600 

AACTTCATGT GTACCACCAT CTTTAGTACG TACATTATTT ACAAAACTTA AAATACTTT C 666 0 

30 TGAATATTGA TCATTATATT GGAAAGCTAC GTCTACCTCT ATACCATTTG CTTCACCTGA 6720 

AAATGTAGCC ACGTCATGCA AAACTTCTTT TCCTTCATTG ACATAACTAA CAAACTCTTT 678 0 

GATTCCTTCT TATAATGGTA TGTCTT 6806 

55 (2) INFORMATION FOR SEQ ID NO : 4 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1716 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

4s (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

GGGGCAnAAA TTCCAATACA CTCATTACCA AATATATACA CCACTTCCTG CTACAAGTnn 60 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AGAGCATTTC CAAATTTTCT TCATCTTCCA 120 

SO TTGAATTTAA ACATTCTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 240 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTTTGGAT A CGAACAT CA 360 

TTTTTAATGA TATCAGTATT ACCTAAAGTA TTTTTGATAT CTTCAACTGA GATATTAAAT 4 20 

5 TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GATTTTTTCA 4 80 

GCTTCAGCGA TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 54 0 

ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 600 

w 

AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 6 60 

ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CTTTTG TAAT CGCTTCTTCT 720 

TTTTCAACAT TTTCAGCATC TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTTTTCT 780 

15 

TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAG CAATTT C ATCTGTTAAT 840 

TCTGGGACTT CTTTAAATTT AATTTC GTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG 900 

2o GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 960 

ACTTT CATAC CTTCTAATTG CTCTTCGAAA CCAGGTATGA ATGAACCTGA AC CGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACCTTCG AATTCTTCtC CGTCAACTGA AC CACTAAAG 10 80 

2s TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT CTTTAACGAC CATTTCAGCT 1140 

AAATGT CCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 1260 

30 GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTGT 13 20 

G CAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 13 80 

TCGATAGCAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGG C 144 0 

35 ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 150 0 

TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 156 0 

TCCTTTTTTT CCCAAGTTGC TGTCATGTAT ATATAC CTCC ATGATTAACT AATTTATTTT 1620 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 16 80 

TATATATTTG TAAAT CAACT TTTTTCGTCA AAACTA 1716 

4s (2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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TGACCAAGTA CGTTTCGAAG TTGCCATTAA AGCATTAAAC CCATCATTGA AAGCATTCGC 6 0 

ACCTGTACGT GAGTGGG CAT GGAGTCGTGA AGAAGAAATC GATTATGCAA TTAAACATAA 12 0 

TATCCCTGTA TCAATCAACC ATGATTCACC TTATTCTATC GATCAAAATC TATGGGGCAG 180 

AGCGAATGAA TGTGGTATTT TAGAAGATCC TTATGCTGCG CCACCAGAGG ATGCGTTCGA 24 0 

TCTAACAAAT GCTTTAGAAG AAA CAC CAG A TACTGCTGAT GrAATCATTT TAACGTTTGA 3 00 

TAAAGGCATC CCAGTTCAAA TTGATGGCAA AACATATGAA TTAGACGATT TAATTTTAAC 36 0 

GTTGAATGCA TTAGCTGGTA AGCATGGTAT CGGAAGAATT GACCATGTAG AAAATAGACT 42 0 

TGTAGGTATC AAATCAAGAG AAATTTATGA GGCACCTGCT GCAGAAGTTA TTTTAAAAGC 480 

GCATAAAGCA TTAGAAACGA TTACGTTAAC GAAAGATGTC GCACACTTTA AACCAATCAT 54 0 

TGAGAAGCAA TTTGCTGAAC AACTATACAA TGGACTTTGG TTCTCACCTT TAACTGATAG 6 00 

CTTGAAATTA TTTATTGATA GTACTCAGCA ATACGTAAGT GGTGATGTCA GAATTAAATT 660 

ATTCAAAGGT AATGCCATCG TGAATGGTAG AAAATCACCT TACACATTAT ATGATGAAAA 720 

ATTAGCAACT TATACAAAAG AAGATGCATT T nATCAAG AC GCTGCTGTTG G CTTTATCGA 7 80 

TATCTATGGT TTACC 795 
(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

CGATTGAAAG AAGACGGTTC AGTTGAAAAG TTTCCAAAGC CAGTAATTAG CCAACAACCA 60 

GAAGGATATA CGAGTCATTT TAGAGATCCT AAAGTTTTTA AATATG ATG r GaAATATTAT 120 

GCAATCaTTG GTGCmCaAAA TAATGATCaG CaAGGTCGAT TATTACTTTA TAATACTGAA 180 

GATATAATTA ATTGGCATTA TTTAGGTGAA ATAAATACAG AGTTGGATGA TTTTGGATAT 24 0 

ATGTGGGAAT GCCCAGATTA CTTTAATGTA GATAATCAAG ATGTCATACT TATTTGTCCA 3 00 

CAAG GTATTG AACCAAAAGG CGATCAGTTC AAAAATATTT ATCAAAGTGG TTATATACTT 360 

GGAAAGTTTG ATATTGAAAA GTTAACATAT GAACATGAAA ATTTTGTCGA GCTTGATAAT 4 20 

GGTTTTGATT TCTATGCACC TCAAACATTT TTAGATGAAA AAGGCCGACG AGTACTAATT 4 80 

GGATGGATGG GGTTACCGGA AATCGAATAT CCTACTGATA ATGAAGGATG GGCCCATTGC 54 0 
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GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTCA GGctACGCAA ATAAATTTAC 6 60 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA AT CATAGATA TTTTGGATAA 7 20 

5 TGATGCTACC GAAGTGTACT TTGAATTAC G TACAT CTAAG ACTTCTTCAA CATTAATTGC 7 80 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 840 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 8 87 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 
<B) TYPE: nucleic acid 
,s (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

20 

TTGGAAnCAA AAAACCATTG GTAAACCGTG TTnAACCGGA TTTC CGATGG ACCTTTTAAA 60 

ACnACCAAAT AGAAAGCTTT GATAAAAGGT AATTATGGTA CTGATAACAA ACAAGTTCAA 12 0 

2S AAACATCATG ATTTAGTACG TATGCTTTTG ATGGATCAAG ATGGTTTTTT AACTGAAAAT 180 

AATAAAGTTG ATCATTTCAT TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 240 

AAAAATGCAA AAGAtATATC CATTTAGAGT ACwATACTTT CGCTTwAGAT GGTTwAGGTA 300 

30 AAAGAATTTT ACATGCTTTA GAAGAAAAAT TGAAA CAAGG TCTAGAAGTA AAAATATTAT 360 

ATGATGATGT T GGATCT AAA AATGTTAAGA TGG CAAATTT TGATCATTTT AAATCGTTAG 420 

GTGGAGAAGT TGAAGCATTT TTTGCTT CAA AATTACCGTT ATTGAATTTC AGAATGAATA 4 80 

35 ATAGAAATCA TAGAAAAATC ATCGTAATCG ATGGTCAACT AGGTTATGTC GGAGGATTTA 54 0 

ACATTGG TGA TGAATATCTm GGATTAGGAA AATTAGGATA TTGGAGAGAT ACGCATTTAC 60 0 

GTATACAAGG GGATGCGGTT GATGCACTGC AGTTGCGATT TATTTTAGAC TGGAATTCGC 660 

40 

AAGCGCACCG TCCACAATTT GAATATGATG TTAAGTATTT CCCTAAAAAG AACGGAC CAT 72 0 

TGGGCAATTC ACCAATTCAA ATAGCTGCAA GTGGCCCGGC TAGTGA CTGG CAT CAAATTG 780 

AATACGGTTA TACAAAAATG ATTATGAGTG CAAAGAAATC TGTATATTTA CAATCACCAT 840 

45 

ATTTCATTCC GGATAATTCA TATATaAATG CCATTAAAAT TGCTGCTAAA TCAGGTGTAG 900 

ATGTACATTT AATGATTCCA TGTAAGCCAG ATCATCCATT AGTATATTGG GCGACATTTT 960 

50 CAAATGC CTC TGACTTATTA TCAAGTGGTG TTAAAATTTA TACGTATGAA AATGGATTTA 1020 

TACATTCTAA AATGTGCTTA ATTGATGATG AAATCGTATC AGTGGGCACA GCAAATATGG 10 80 

55 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC H 83 
(2) INFORMATION FOR SEQ ID NO: 477: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 77: 

GGTTATATGT TTGGTATTAC TCATTATATT GATTC-GAGGT TGTGTCATTA TGACAAAAAC 60 

15 

AAATGGTCGA AACGCTCAAA TTAAAGAAAA TTTCAACAAA ACATTAAGTG TATATCTAAC 120 

CAAGAATCTC GATGATTTTT ACGATAAGGA AGGTTTTCGA GATCAAGAAT TTGATAAAAG 180 

^ AGATAAAGGG ACTTGGATTA TTTATTCTGA AATGGTTATC GAACCAAAAG GGAAnAATAT 24 0 

GGAATCGAGA GGAATGGTGC TCTATATCAa TCGCAATACT AGAACmACGA AGGGTAATTT 3 00 

TATTGTCACC GAAATAACTG AAGATAGTAA AGGATATTCA CGTAGTAAAG AAAAAAAATA 360 

2S TCCTGTCAAG ATGGAAAATA ATCGAATTAT TCCAACAAAG CCTATACCGG ATGACAAGTT 42 0 

AAAAAAAGAG ATTGAAAACT TTAAGTTCTT TGTACAATAT GGAAATTTTA AAGATTTTAA 4 80 

AGATTATAAA AATGG TGAT A TTTCATATAA TCCTAATGTG C CAAGTTATT CTGCAAAGTA 54 0 

30 TCAATTGAAT AATGATGACT ATAATGTTCA ACAGTTAAGA AAACGATATC ATATTCCAAC 60 0 

CAAACAAGCG CCCGAATTAA AATTGAAAGG AT C C GGCAAT TTAAAAGGCT CATCCGTAGG 660 

ATCTAAGGAT CTAGAATTTA CGTTTGTAGA AAATCAAGAA GAGAATATCT ATTTTTCAGA 720 

35 TTCGGTCGAA TTTACACCTA GCGAGGATGA TAAATCATGA GTCAAAC GGA ATATCAAATA 78 0 

AAATCTGGCA ATATAAAAGG TAACTCTGAA G AAA CAAGT A CAGTATCTAA TATAAGTTAT 84 0 

GAAATAGAAA ACGCAAATAA CAGTGGTTTA AAACAAAATA AAATTGATAA ACAAATTAAA 90 0 

AAGTTA CAAG AAAAAAATAA ATTCCCTAAA AATCTTTCAT ATCTTAAAAG TTATACGGAC 96 0 

CCCAAAACAG GCACGACTAC AAGCGCCTTT TTAAATAAAG ACACTGGCAA AGTTACTTTA 102 0 

^ GGTATGACAG GTACTAATGT ACACAAAGAC GCAATATTAA AACAAACATT TGGTGTTCCT 10 80 

TCTTATCAAG GATATATAGA CGTGAGTGAA ACgCTaAAAG ATATTGGGGC CGATGTCAAT 114 0 

ATTGGCCTTC ATTCCGTCAC AGATAAAGAT CCACATTATA AAAATACCCA AGACTTTATC 12 00 

so AAAAATATCA AAAAAGACTA TGATATTGAT ATTATTACCG GACATTCGCT GGGCGGTAGA 1260 

GATG CGATGA TTTTAGG TAT GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 132 0 

55 
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ATT GAAAAGT ACGATGGTCA CATTGTAAGA TTTGTGTCTG ATGAAGACGA ATTAGATGCA 1440 

GGTGTCCGCA ATCATTTATA TGAAACTGCT GGAGAAAAAA TAGTACTTAA AAATGGAGAA 150 0 

GGCCATGCAA TGAGTGGTAT TTTAATGAGC AGAACACAGG CTATAATCTT AGCTGAATTA 1560 

AACAAAG TTA AAGG CTACCA AGACGAAAAT AATAAAGCAT TAAAATCCGT TCGTAAACAA 1620 

ACGAGGCATA GATTACATAA AGTAGAGACG TTAAGAGCGA ATTGGATTCA AACAACGGGT 1680 

GGATCACTCT CTTCCTCCCw ACAACAATTA TTAGAAGCTT TAACAGCACT AACCATTGCC 1740 

GAAGGCTTAA ATCAATTAGT GAATGAAGAA AGCCAACATT TGAAAAAATG TATCACGCGA 18 00 

TGGCACATAA ATTTGGAGAC AACTGGAAAA AAGCGCAAGA AGTTGGAAAT GAAATTGGTG 18 60 

AAAAATTAAC CTCTGAAGAG GTTATAGATG rATTAAGAAA AGGTGGCGCG TATGAAAGTa 1920 

AACTTGAAAC AGATCCCAAA AGAAAAATTG ATGATAAGAT AAAGAAATTA AATGATGTTT 1980 

ATAAAAATTG TAATGGCTAT ATCGCAAAAA TT AAA CAGAG TATCGAAGCA ATTGTTTCTA 204 0 

ATGACCAAAT GTTAGCGAGC CAGATTGATG GGATGATGTA ATGTTTACTA CGTATAAnAA 2100 

TATTAATGAA CTTGAAAATG C CTATGATGA AGAAAGAAAA CAATTGAATG ATGCATTCAA 2160 

TCAAATTGAT GAATTAAGAC ATCAAACACG CAAGAmATGT GAA CAAATGT ATGATCATTT 22 20 

CTTATATCTC AAACATAAAA TGAATTmyms TGAAGACGCT ATGATCAGGA TGACACGTAT 22 8 0 

TATAGAATCT TTCGATAGAG AAACGAATCA ACGTATCCGA CATCACGAAA TG 23 3 2 
(2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATATGGT GGAAAGTCTT 60 

TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT TGGCATACGA 12 0 

ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTGAT TATCTTCTAA TGAATCAAAG 18 0 

TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT TTTACCGATA 24 0 

GATTGTGATT CATCGAAATC GATTGAGAAT TTAGAACTTA ATTGCTCAAA AATCTTAATC 300 

GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT GCTAATGCAG 36 0 

GATGGAAGTG TAAAACTGTA CGTGCATCTT TACTAC CTTC AACGCCTTCT TCATCATATG 42 0 
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GAATATATTT TTCGTTCGTT TCTGGATCAT GGTATCTGAA ATCTTCACCA GAGTGTT CAG 54 0 

CATGTTTACG TAAGTCGAAG TCTGTACGAC TTGCGATACC CCATAACTCA CCCCAACCAA 6 00 

ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 66 0 

CATGATCACG TAAACGCATA TTTTCACTGC TCATATTTAA GCTTGTTAAC CAGTCACTTG 72 0 

CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 780 

GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 84 0 

GGaATGaTTT ACCAATTTGG ACCGG 86 5 
(2) INFORMATION FOR SEQ ID NO: 47 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 

TAATGAGTAA ATAAGTACCA GATAAAATCA TGAATATCAT CCAACATGCG GTTAACTCTA 60 

CTAAATAATT AATAATAGTA TTTTCAGTAA ACAAAGAACT ATGTATACTT CGCATCACAT 12 0 

TAGAATACG T ATGTTTCGCA TTTTGATCTG CAACAAATTG ATTGTTATGA TCTAGAAAGA 180 

CGTAACGTTG ATTTCCTGCC ATATCACTCA GTGTAATTCG TTTGTTATAT GGTTCATCAA 24 0 

GTATGCTAAC TTTACTTACA AAGAATCCTT CATATTGTTG TTCAACTTGA TGTACTGCAT 300 

CATTTAATGT TTGATGCGTT TTTACATCAC TGTCACCAAA AAACTCATTC TTATAAATAT 360 

TATTTTCAAC TTCTGGAAAG AACAAGTAAC CAATGCCCGA AATGGTTAAA GTGATTAACA 420 

GTGGAGCAAT AAATATTGCT GCATAGAAAT GTAATCTTTG TAATGGATTA AATGTATTTT 48 0 

TCATATTTCC CTCCCAATTG G CT ATT AT AC GGTGTCAATT CTGTGATGTG TGTGAACAAA 540 

CTGTGACAAT ATTTATTTTC TAGAAAAATT TAACGATGAT TTGTGATTTT TAGAAAAATG 6 00 

AACTTTTAAG TTGGAATGTT TGAAGAAAAT TGATTATTCG TATGTTTTAT CAAGCAGCTA 66 0 

TGATAAAATT TAAACATAAT ACAATG C GAG CCATTTAACG ATCTATGTTT AAATGGACAT 72 0 

CGATATTGTA TGAATTCGTT GTAACAAGCA AG CATTT CT A TGTGAACGAA CCAAAGGGGA 78 0 

AAGTAACATG ATTAATAAAG AACAATTAGA TCTTTTATAT AAATTAAAAA AAGAAGTTGA 84 0 

AAAGTCGCGA AATGAAGCAC TTTTACATAC AATTAACCAA GTAATTAAGA AAGTATATTT 9 00 

GCAGCAATAT ACATGTTCGT TCGTTGGACA TTTTTCTGCA GGTAAATCGA CACTGATAAA 96 0 
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TATTGTGTCA GTTTCAGACA ATCACGATAT TATTGCTAAT TTGCCGAATC AAACGTATGC 10 8 0 

CAAATTATCT AATTATGATG AAGTAAGGGA AATGAATCGC CAAAATGTCG ACGTTGAATC 114 0 

TGTAGAAATT AATTTTCAAT CAGCTAAATT TGAAAATGGG TTTACGTTGC AAGATACACC 120 0 

AGGTGTTGAT TCAAATGTTG CATCACATCA GTCAATAACA GAACAATATA TGTATACAAG 12 6 0 

TAATATGATA TTTTATACGG TTGACTATAA CCACGTTCAA TCTGAACTTA ACTTTAAGTT 132 0 

TATGAAGCAT ATAAATGATG TTGGaATACC TGTTGTGTTT ATCATTAATC AAATTGACAG 13 8 0 

CATCCAAGAC GATGGAATTG TCATTCTCTA CGTnTTAAAT CTCGAGTTGG AAAAATCAAT 144 0 

TGGC 1444 
(2) INFORMATION FOR SEQ ID NO: 480: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 

GCAGCAGCTT ATCGTGAGTT ATCATTATTA TTACGTAGAC CTCCAGGTCG TGAAk CATAC 60 

CCAGGTGACG TATTCTACTT ACATAGTAGA t T ATTAG AAA GAGCAGCAAA ATTAAACGAT 120 

GACTTAGGTG GCGGTTCAAT TACTGCATTA CCAATTATCG AAACACAAGC TGGTGATATT 180 

TCAGCTTATG TACCAACAAA CGTTATTTCA ATTACAGATG GACAAATCTT CTTACAATCT 24 0 

GATTTATTCT TCTCAGGTGT AAGAC CAGCG ATTAATGCCG GACAATCTGT ATCTCGTGTT 300 

GGTGGATCTG CACAAATTAA AGCAATGAAG AAAGTTGCTG GTACGTTACG TCTTGACTTA 360 

GCGTCATACA GAGAACTTGA ATCATTTGCA CAATTCGGTT CAGACCTTGA TGAATTTACT 420 

GCAAGTAAAT TAGAACGTGG TAAACGTACT GTTGAAGTCT TAAAACAAGA TCAAAACAAA 480 

CCATTACCAG TCGAACACCA AGTGTTGATT ATTTATGCAT TAACAAAAGG ATATTTAGAT 54 0 

GATATTCCAG TTGTAGATAT CACACGTTTT GAAGACG AG T TAAACCA CTG GGCAGAATCA 600 

AATGCTACTG AACTGTTAAA TGAAATCAGA GAAACTGGTG GCTTACCAGA TGCTGAGAAG 660 

TTTGACACAG CAATTAACGA ATTCAAAAAA AG CTTTAGCA AATCTGAATA ATAAACAAGT 720 

TTAGTATAAG GTGGTGAGAT AGTGGCTTCT CTTAAAGAAA TAGATACTCG AATAAAATCA 7 80 

ACCAAAAAAA TGAAGCAGAT TACGAAAGCG ATGAACATGG TATCAAGTTC AAAACTTCGT 84 0 

AG AG CTG AAA AAAATACAAA ACAATTCACA CCATATATGG ATAAAATGCA AGATGCAATT 90 0 
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ACTAGAAGTG GATATTTAGT TATCACGAGT GATAAAGGTT TAGCAGGTGC ATATAGTGCA 1020 

AACGTGCTTA AAAAATTGAT TACTGATATT GAAGCGAAAC ATCAAGATAG TAGCGAATAC 10 80 

£ AGTATTGTAG TTTTAGGGCA ACAAGGTGTT GATTTCCTAA AAAATAGAGG TTATGACATT 114 0 

GAGTATTCTC AAGTAGACGT ACCTGATCAA CCTTCTTTCA AAT CTGTTCA AGCACTAGCT 1200 

AACCATGCTA TAGACTTATA CAGTGAAGAA GAAATTGATG AATTAAATAT ATACTATAGT 1260 

10 CATTATGTCA GCGTTCTTGA AAACAAGCCT ACATCTAGAC AAGTATTACC ATTATCTCAA 1320 

GAGGATTCTA GTAAGGGGCA TGGTCATTTG TCTTCTTATG AATTTGAGCC AGATAAAGAA 13 80 

TCTATCTTAA GTGTAATCTT GCCTCAATAT GTTGAGAGTT TGATTTACGG AACAATATTA 144 0 

15 

GACGCAAAAG CAAGTGAGCA TGCAACACGT ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GCAACTGAAC TTATTGATGA CTTATCATTA GAATATAACA GAG CGAGACA AGCAGAAATT 1560 

2o ACGCAACAAA TTACTGAAAT TGTTGGTGGT TCCGCAGCGC TTGAATAATA TTTAAAGGAG 16 20 

GAAAATAGCA TGGGAATTGG CCGTGTAACT CAAGTTATGG GTCCTGTAAT TGATGTTCGA 16 80 

TTTGAACATA ACGAAgTTCC TAAAATTAAT AACGCCTTGG TTATTGATGT GCCTAAAGAA 174 0 

25 GAAGGTACAA TACAACTAAC ATTAGAAGTT GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 1800 

ACAATTGCGA TGGATTCAAC TGATGGTGTC CAAAGAGGCA TGGATGTAAA AGATACAGGC 18 60 

AAAGAAATTA GTGTACCTGT TGGTGACGAA ACATTAGGTC GTGTATTTAA TGTACTAGGT 1920 

30 GAAACAATTG ACCTTAAAGA AGAAATTAGT GATTCTGTTC GCCGCGATCC TATCCATCGT 1980 

CAAGCACCAG CATTCGATGA ACTTTCAACA GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 204 0 

GTAGTAGATT TACT AG CAC C TTATATTAAA GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

35 GGTGTAGGTA AAACAGTATT AATCCAAGAA TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

GGTATTTCTG TATTCGCCGG TGTAGGTGAA CGTACTCGTG AAGG TAACGA TTTATACTTC 2220 

GAAATGAGTG ACAGTGGTGT AATTAAGAAA ACAGCCATGG TATTCGGGCA AATGAATGAG 2280 

40 

CCACCTGGTG CACGTATGCG TGTTGCATTA TCTGGTTTAA CAATGG CTGA ATATTTCCGT 2340 

GACGAACAAG GTCAAGACGT ATT ATT ATT C ATCGATAACA TTTTCAGATT TACACAAGCT 24 00 

^ GGTTCTGAGG TATCTGCATT ATTAGGTCGT ATGCCTTCTG CAGTAGGTTA CCAACCAACA 24 6 0 

CTTGCTACTG AAATGGGACA ATTACAAGAA CGTATTACGT CTACAACAAA AGGATCAGTT 2 52 0 

ACTTCTATTC AAGCGGTATT CGTACCTGCC GATGACTATA CTGACCCAGC GCCTGCGACA 2580 

SO GCGTTTGCCC ATTTAGATGC AACTACAAAC TTAGAACGTA AATTAACTGA AATGGGTATT 2 64 0 

TATCCAGCCG TGGATCCATT AGCGTCTACA TCAAGAGCAT TGGAACCATC AATTGTAGGT 270 0 

55 
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CAAGATATCA TTGCTATCTT AGGTATGGAC GAATTATCTG ATGAAGATAA ACAAACAGTT 2 820 

GAACGCGCAC GTAGAATTCA ATTCTTCTTA TCTCAAAACT TCCACGTAGC GGAACAATTT 2880 

5 ACTGGTCAAA AAGGTTCTTA TGTACCTGTT AAGACAACAG TTGCAAACTT TAAAGATATC 294 0 

TTAGATGGTA AATATGACCA TATTCCAGAA GATGCATTCC GTTTAGTTGG TAGCATGGAT 3 000 

GATGTTATTG CAAAAGCTAA AGATATGGGT GTTGAAGTAT AACAATTAGG AGGAATGGAT 3060 

to 

AATGAATACA TTAAACCTAG ATATTGTCAC TCCTAATGGT TCTGTTTACA ATCGTGATAA 3120 

TGTTGAACTC GTTGTTATG C AAACAACAGC TGGTGAGATA GGTGTCATGA GTGGACATAT 3180 

TCCAACTGTA GCTGCTTTAA AAACAGGCTT TGTAAAAGTG AAATTTCACG ATGGAACTGA 3240 

(5 

ATATATTGCT GTAAGCGATG GCTTTGTTGA AG TTAGAAAA GATAAAGTTT CAATCATTGT 3300 

TCAGACTGCA GAAACTGCAA GAGAAATTGA TGTTGAAAGA G CT AAATTAG CCAAAGCAAG 3360 

20 AGCAGAGTCT CACTTGGAAA ATGATGACGA CAATACTGAT ATTCATAGAG CCGAAAGAGC 3420 

TTTAGAGAGA GCAAATAACC GTTTGCGTGT GGCTGAATTA AAATAGTAAA TAAAGGGTCG 34 80 

AAGATGTGAT TTCATATCTT CGACCCTTTT TTGAATTATA TTGATTTAAA GATACAAAAC 354 0 

2S ATGAGAGGGG GGAAGGAATT GATAAAGAAC CATTAAAGAT TTATGATGTA GTGGTTCTTT 3600 

ATCATTAAAC ACAGCTAATG TGTATTTAAA AATAGGaayA CATgAGTAAA ACTCATGTAT 3660 

AAGAAATACT AATTTCTAAA GAAAAAGTAT TTCTTTATGT TGGGGCCCCG TCAACTACTG 3720 

30 CCAAATACAA CACTATAGAG TCTAGACATT GATTTATGTC CGACTCCCAA GAATAGTTTT 3780 

ACTTTTTTAC AATCACTAAT AG ATTG CTAA AATCAAAATT TCCTTCACCA CTATCTACAG 3 840 

TCGACATTTC ATTTTTTGAA ATTATCTACA TTTTTTCATA CCAAGATATT TTATAGTTAT 3900 

35 

GATATTTATG TAAAAAGAAT TATATAGTAA GTTAGCTTAA ACTTTACTAA AAACGGGTAT 3960 

TAAACTTTGT ATCATTATTT AAATTTTTCA TGTACAATGT AATACAGTAA TCTTATGAGG 4 020 

TGATAAAATG GATTATATCG GACAATATGC AGTTATCCAT TTAG TGTTAC ATGTTGTATG 4 080 

TATTTGTATT GCCTATTGGG CTTTACAATC AATTAGATTA GATCAATTTT TTAAAAAAGG 4140 

ATACGCCACT CAATTACAAG TGTGTATGAT ATTTGTTGCT ATTTTATTAG GCACTGCAGT 4 200 

45 AAGCAATTTT ATTGTAGATT TGTTACAATA CTCGACGCAG GTAAAATATT TAATAAAATA 4260 

AGTCTAACTC TATGATTTGT AATCAAAACT AGATATAATT AAATAATGAC TTAAAATAAT 4320 

TTTAAAATAG GGAAATGTAA AGTAATAGGA GTTCTAAGTG GAGGATTTAC GATGGATAAA 43 80 

SO ATAGTAATCA AAGGTGGAAA TAAATTAACG GGTGAAGTTA AAGTAGAAGG TGCTAAAAAT 4440 

GCAGTATTAC CAATATTGAC AGCATCTTTA TTAG CTTCTG ATAAACCGAG CAAATTAGTT 4 500 
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GACGTTACAT ACAAAAAGGA CGAAAATGCT GTTGTCGTTG ATGCAACAAA GACTCTAAAT 4 6 20 

GAAGAGGCAC CATATGAATA TGTTAGTAAA ATGCGTGCAA GTATTTTAGT TATGGGmCCT 4680 

5 CTTTTAGCAA GACTAGGACA TGCTATTGTT GCATTGCCTG GTGGTTGTGC AATTGGAAGT 4 74 0 

AGACCGATTG AGCAACACAT TAAAGGTTTT GAAG CTTTAG GCGCAGAAAT T CATCTTG AA 4 800 

AATGGTAATA TTTATGCTAA TGCTAAAGAT GGATTAAAAG GTACATCAAT TCATTTAGAT 4 86 0 

10 

TTTCCAAGTG TAGGAGCAAC ACAAAATATT ATTATGGCAG CATCATTAG C TAAGGGTAAG 4 920 

ACTTTAATTG AAAATGCAGC TAAAGAACCT GAAATTGTCG ATTTAGCAAA CTACATTAAT 4 98 0 

is GAAATGGGTG GTAGAATTAC TGGTGCTGGT ACAGACACAA TTACAATCAA TGGTGTAGAA 504 0 

TCATTACATG GTGTAGAACA TGCTATCATT CCAGATAGAA TTGAAGCAGG CACATTACTA 5100 

ATCGCTGGTG CTATAACGCG TGGTGATATT TTTGTACGTG GTGCAATCAA AGAACATATG 5160 

20 GCGAGTTTAG T CTATAAACT AGAAG AAATG GGCGTTGAAT TGGACTATCA AGAAGATGGT 522 0 

ATTCGTGTAC GTG CTGAAGG GGAATTACAA CCTGTAGACA TCAAAACTCT ACCACATCCT 52 8 0 

GGATTC CCGA CTGATATGCA ATCACAAATG ATGG CATTGT TATTAACGGC AAATGG T CAT 534 0 

25 AAAGTCGTAA CCGAAACTGT TTTTGAAAAC CGTTTTATGC ATGTTGCAGA GTTCAAACGT 54 00 

ATGAATGCTA ATATCAATGT AGAAGGTCGT aGTGCTAAAC TTGAAGGTAA AAG TCAATTG 54 60 

CAAGGTGCAC AAG TTAAAGC GACTGATTTA AGaGCAGCAG CCGCCTTAAT TTTAGCTGGA 552 0 

30 TTAGTTGCTG ATGGTAAnAC AAGCGTTACT GAATTAACGC ACCTAGATAG AGG CTATGTT 558 0 

GACTTACACG GTAAATTGAA GCAATTAGGT GCAGACATTG AACGTATTAA CGATTAATTC 564 0 

AGTAAATTAA TATAATGGAG GATTTCAACC ATGGAAACAA TTTTTGATTA TAACCAAATT 57 00 

35 

AAACAAATTA TACCTCACAG ACAGCCATTT TTATTAATTG ATAAAGTAGT TGAATATGAA 5760 

GAAGGTCAAC GTTGTGTGGC TATTAAACAA GTATCAGGAA ACGAACCATT CTTTCAAGGG 58 20 

CATTTTCCTG AGTATGcGGT AATGCCAGGC GTATTAATTA CTGAAG CGTT AcTCAAACAG 58 8 0 

40 

GTGCGGTAGC TATTTTAAAT AGTGAAGAAA ATAAAGGTAA AATCGCTTTA TTTGCTGGTA 594 0 

TTGATAAATG TCGTTTTAAA CGTCAAGTAG TACCTGGTGA TACTTTAACG TTGGAAGTAG 6000 

45 AAAT CACTAA AATTAAAGGA CCAATAGGTA AAGGTAATGC TAAAGCTACT GTCGATGGTC 606 0 

AACTTGCTTG TAGTTGTGAA CTTACATTTG CAATTCAAGA TGTAAAATAA AACAAAAAAA 612 0 

ACATTCAAAG ATTTAATGTG TTGGCATAAT CTTTGAATGT TTTTTATTTT ACTCTTCTAA 6180 

so TTTTTCATCC TTTAACTTTG GTTTAGACTG CaTCATTCGA TTAAATGATT TTTTTAATTC 624 0 

TTCACCAGAT AATCCATCAT CAATAAGTTG GTTCTAATAA ACTTTCAGCA TACTGTTGGA 63 0 0 

55 
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(2) INFORMATION FOR SEQ ID NO: 4 81: 

(i) SEQUENCE CHARACTERISTICS: 

IA) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

AGTTGCTACA CCAGACATGA TGGGTGAAGT TGGTAAATTA GGTCGTGTAT TAGGACCAAA SO 

AGGTTTAATG CCAAACCCTA AAACTGGAAC TGTAACAATG GATGTTAAAA AAGCTGTTGA 12 0 

15 

AGAAATCAAA GCTGGTAAAG TAGAATATCG TGCTGAAAAA GCTGGTATCG TACATGCATC 180 

AATTGGTAAA GTTTCATTTA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 24 0 

20 TGTATTAGCT AAAGCTAAAC CATCATCTGC TAAAGGTACA TACTTCAAAT CTGTTGCTGT 300 

AACTACAACA ATGGGTCCTG GAGTTAAAAT TGATA CTGCA AGTTTCAAAT AATAAATGAT 360 

ATAAACAATT ACAGGCTGAA AGAAATATCT TTCAGTCTGT AAAAATATAT TGACAATAAG 4 20 

2S TAATTTCCAA GTTATATTAC TTATTGTGAT TATTTTACCT AAGACAGTAG GAGTTATTTA 4 80 

TAACTTAAAA TTTATCCTGC CGAGGCTAAA ATTG A CTTGA ACGTGATGAT CTATGATCTT 54 0 

TCAAGCACTT TTTGCCGTGG GTAGAAAGTG CTTTTTTTAT TAATTTTAAA AAAAGCACCA 600 

30 AAAATTTAAA TGGAGGTGTC TGAATGTCTG CTATCATTGA AGCTAAAAAA CAACTAGTTG 66 0 

ATGAAATTGC TGAGGTACTA TCAAATTCAG TTTCAACAGT AATCGTTGAC TACCGTGGAT 720 

TAACAGTAGC TGAAGTTACT GACTTACGTT CACAATTACG TGAAGCTGGT GTTGAGTATA 780 

35 

AAGTATACAA AAACACTATG GTACGTCGTG CAGCTGAAAA AGCTGGTATC GAAGG CTTAG 840 

ATGAATTCTT AACAGGTCCT ACTGCTATTG CAACTTCAAG TGAAGATGCT GTAGCTGCAG 900 

^ CGAAAGTAAT TTCTGGATTT GCTAAAGATC ATGAAGCATT AGAAATTAAA TCAGGCGTTA 960 

TGGAAGGCAA TGTTATTACA GCAGAAGAAG TTAAAACTGT TGGTTCATTA CCTTCACACG 1020 

ATGGTCTTGT ATCTATGCTT TTATCAGTAT TACAAGCTCC TGTACGCAAC TTCGCTTATG 10 80 

45 CGGTTAAAGC TATTGGAGAA CAAAAAGAAG AAAACGCTGA ATAATTTTTA GCGTAAAAAA 1140 

ATTAAAAATA ATGGAGGAAT TATAAAATGG CTAAT CATG A ACAAATCATT GAAGCGATTA 1200 

AAGAAATGTC AGTATTAGAA TTAAACGACT TAGTAAAAGC AATTGAAGAA GAATTTGGTG 1260 

50 TAcTGcAGCT GCTCCAGTAG CAGTAGCAGG TGCAGCTGGT GGCGCTGACG CTGCAGCAGA 13 20 

AAAAACTGAA TTTGACGTTG AGTTAACTTC AGCTGGTTCA TCTAAAATCA AAGTTGTTAA 13 80 

55 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 1500 

AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 1560 

5 AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 1620 

TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 16 80 

AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 1717 

10 

(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1279 base pairs 

(B) TYPE: nucleic acid 

/s (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 





GTAAATCTGT 


TACTCGAAAT 


GTAACGATTA 


AAGAAAAGGG 


CTCATCTCAA ACATATATTT 






TGTTAGGCTA 


TCCAACAAAA 


GCACAGAAGA 


ATAGTCATAG 


CAAATATAGT GGAGTCTTTA 






TATATAAAGA 


CTTGAAATCA 


ATCGAAGATA 


CAAATAATGC 


TATTACGATT ATCACCATAA 






TTACGGCTGT 


TATTTTCTTA 


ACAATTACAA 


CAGTCTTTGC 


GTTTTTCTTA TCGTCAAGAA 


240 




TTACAAAACC 


TTTAAGACGT 


TTAAGAGACC 


AAGCTACACG 


TGTATCTGAA GGGGATTACT 


300 


30 


CTTATAAACC 


TTCTGTCACA 


ACGAAAGATG 


AAATTGGTCA 


ATTATCGCAG GCATTTAATC 


360 




AGATGAGTAC 


AGAAAT CGAA 


GAGCATGT CG 


ACGCATTATC 


CACAT CT AAA AATATTAGAG 


420 




ACAGCTTAAT 


TAACTCTATG 


GTAGAAGGTG 


TCCTAGGTAT 


TAATGAGAGT CGACaAATTA 


480 


35 


TCTTATCTAA 


TAAGATGGCG 


AATGATATTA 


TGGACAATAT 


TGATGAAGAT GCTAAAGCTT 


540 




TCTTATTAAG 


ACAAATAGAA 


GATACTTTTA 


AATCAAAACA 


AACTGAAATG CGCGATTTAG 


600 


40 


AAaTGAATGC 


ACGATTCTTT 


GTTGTGACCA 


CAAGCTATAT 


CGACAAGATT GAACAGGGAG 


660 




GTAaAAGTGG 


TGTTGTTGTG 


ACAGTTCGTG 


ATATGACTAA 


TGAGCACAAT CTAGATCAAA 


720 




TGAAGAAAGA 


TTTCATTGCT 


AATGTATCAC 


ATGAATTACG 


TACACCGATA TCATTACTTC 


780 


45 


AAGGTTATAC 


TGAATCAATT 


GTAGATGGTA 


TTGTTACAGA 


ACCGGATGAA ATAAAAGAAT 


840 




CGCTTGCCAT 


TGTCCTTGAT 


GAATCGAAAC 


GTTTAAATCG 


TTTAGTTAAT GAATTGTTAA 


900 




A1GTCGCACG 


CATGGATGCT 


GAAGGGTTAT 


CCGTAAATAA 


AGAAGTTCAG CCTATTGCAG 


960 


50 


CGTTACTAGA 


TAAGATGAAA 


ATTAAGTATC 


GCCAACAAGC 


TGATGATTTA GGTCTAAATA 


1020 




TGACTTTTAA 


TTAyTGTAAG 


AAGCGTGTTT 


GGAGTTATGA 


TAwGGATCGC ATGGA CCAAG 


1080 



55 
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TTACTTGTGA TGAAAATGAA AGCGAAGATA TTTTATACAT TAAAGATACA GGTACAGGCA 12 00 

TTGCACCAGA ACATTTACmA CAAG TATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAnA 1260 

5 ACCCCGnGGT AACCAnGTA 12 79 

(2) INFORMATION FOR SEQ ID NO: 483: 

(i) SEQUENCE CHARACTERISTICS: 
io (A) LENGTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 83: 





GAGCTGTTGT 


TACTTTGATG 


C CTG CAG CTT 


TATTACGGCT 


GACTTGGTAA 


TGATAAGTTT 


60 


20 


CAG CATATTG 


CTCAATATAT 


GCTATATCAT 


ATTGAATGGT 


ACGAGGTGAT 


ACACCAAGTT 


120 




GATTAGCAAT 


GGTATTGATT 


GGAATAAACG 


TTTG CTCATG 


AATTAAAAGA 


TACAAAATTT 






CGATTTGTCT 


ATAACTTAAC 


AACGTAATAT 


CCTCCTATTT 


GTAATTGTAA 


GCGATTTCTT 






AAAAACGTAG 


ATATGCAATC 


TCTTTCATAT 


TTTAATCCGA 


AAAATTGCAT 


ATCAAAATGT 






TTATGGCGCA 


AGATTTTATA 


GGAACTTTTA 


AAATAAATTA 


rATATTCATG 


TTGACAATTT 


360 




AAAAATGTCG 


CAGTATATTT 


AGTTAGACAT 


CTAACGAAAT 


GGTGGTGCAA 


TAAATGGAAT 


420 


30 


TCACTTATTC 


GTATTTATTT 


AGAATGATTA 


GTCATGAGAT 


GAAACAAAAG 


GCTGATCAAA 


480 




AGTTAGAGCA 


ATTTGATATT 


ACAAATGAGC 


AAGG TCATAC 


GTTAGGTTAT 


CTTTATGCAC 


540 




ATCAACAAGA 


TGGACTGACA 


CAAAATGATa 


TTGcTAAAGC 


ATTACAACGA 


ACAGGTCCAA 


600 


35 


CTGTCAGTAA 


TTTATTAAGG 


AACCTTGAAC 


GTAAAAAGCT 


GATCTATCGC 


TATGTCGATG 


660 




CACAAGATAC 


GAGAAGAAAG 


AATATAGGGC 


TGACTACCTC 


TGGGATTAAA 


CTCGTAGAAG 


720 


40 


CATTCACTTC 


GATATTTGAT 


GAAATGGAAC 


AAACACTCGT 


ATCGCAGTTA 


TCTGAAGAAG 


780 




AAAATGAACA 


AATGAAAGCA 


AACTTAACTA 


AAATGTTATC 


TAGTTTACAA 


TAAATGATAA 


840 




GTGTGACTGG 


TAGAAATCAG 


TCACTTTGTC 


TTTAATATTA 


TAGTTAGATA 


TCTAATTGTT 


900 


45 


AGTAAG CTAA 


TTATTGGAAA 


AGACAAGGAG 


TATTGAACAA TGAAAGACGA ACAATTATAT 


960 




TATTTTGAGA AATCGCCAGT 


ATTTAAAGCG 


ATGATGCATT 


TCTCATTGCC AATGATGATA 


1020 




GGGACTTTAT 


TAAGCGTTAT 


TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm 


1080 


50 


GAyAGCCACA TGATTTCTGC 


tAatCTCTCT 


AACACTGCCA 


GTATTTGCTA 


TCTTAATGGG 


1140 




GTTA 












1144 



55 



1410 



EP 0 786 519 A2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1158 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 484: 

10 

ACAACCAATT TTACTAAACA TGGTrrTCAT gCATTTTCAA TATTTAGATG TCGACCGCAT 60 

AATCGAAGAA TCGCCGACAA TAGTACTTAT CGATGAGTTA GCACATACGA ATATTTCTAG 120 

^ AGATCGTCAT GAGAAACGAT ATATGGATAT TGAAGAAATT TTAAATCATG GTATCGATGT 180 

TCATACCACT TTGaACATTC aaCATATTGA AAGTTTAAGT AGTCAAATTG AACTGATGAC 24 0 

CGGTGTACAT GTTAAAGAAC GTGTACCCGA CTATTT CATA ATGAGCGCCG ATGTATTAGA 300 

20 AGTCGTAGAT ATCTCACCTG AACAATTAAT TAAACG CTTA AAAGCTGGCA AGGTATATaA 360 

AAAGGATAGG CTAGATGTAG CATTTAGTAA TTTCTTTACG TATGCCCACC TAAGCGAAtG 4 20 

CGTACATTGA CGTTAAGAAC AGTTGCCGAC TTGATGAGTG ATAAAGAAAA AGTCCGACAC 4 80 

25 AACCATAAAA CGTCACTCAA ACCTCATATT GCTGTGGCAA TTAGTGGGAG CATTTATAAT 540 

GAAGCAG TAA TTAAAGAGGC ATTCCATATT GCTCAAAAAG AACATG CGAa GTTCACTGCT 6 00 

ATTTATATAG ATGTATTCGA AAAAAACAGG CAATATAAAG ATAGTCAAAA GCAAGTGCAT 6 60 

30 CAACATCTCA TGCTTGCAAA ATCATTAGGA GCAAAAGTAA AAGTAGTTTA TAGCCAAACC 7 20 

GTTGCATTAG GATTAGACGA ATGGTG TAAA AATCAAGATG TAACCAAATT AATTATCGGA 7 80 

CAACATATTA GAAATAAGTG GCGAGACTTT TTCAATACAC CTTTAATTGA CCATTTAATG 840 

TCCTTTGAAC ATAGCTATAA AATCGAAATC GTTCCAATCA AACAAATACC TGTTGAATTG 900 

AAAATGAACA AATCACCCTA TCGTCCTAAA GGCAAACGTT TCGCCATAGA TATGTTAAAA 96 0 

4o ATGATTTTGA TTCAAATAAT TTGTGTAATG ATGGGACTGT GGATTTATCA ACTTGATAAG 1020 

CATGAGTCTA GTACGATTAT TTTAATGATT TTTCTCATCG GCATCATTTT ATTATCCATT 10 80 

TGGACGCGGT CCTTCATCAT TGGCTTTTaG CAGCAATTAt TAACGTATTT GTgTkTAATT 114 0 

45 ATkTTTTtAC GGAACCTA 1158 

(2) INFORMATION FOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2224 base pairs 
50 (B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATGA CGTTTAAACA 60 

5 TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTATTT GCGATGATGT GGGAATAGTT 12 0 

ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 180 

ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 24 0 

W TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 300 

TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 36 0 

GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 420 

15 

GTCGAGGCGG CAGACGATGT TTATTTAGAG TTCCGTCATA CATCTGTGAA AGAAAGACAA 4 80 

GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 54 0 

ATTACGGATG AATTAGGTG C TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 600 

20 

CTAAACCATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AGAACGCCGC 660 

GGAGATGATT TAGTTGTTAA AG AAG CAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 720 

2S TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAGTCCAGTT 780 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA TTTTAGCTGA GATTTTTGAT 840 

AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 900 

30 AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 960 

GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AGAGCTTGGT 1020 

GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGc TAAAG CAACa 1080 

55 aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TGTTTTAGTG 1140 

CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 1200 

GTCGGTAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 1260 

40 

GATCAAGTAC AAAATTATAT TAATAAAGGT ATTGAAGAAG GTGCTGAATT ATTTTATGGT 132 0 

GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 1380 

AATGTAGATA ATCAAATGAC GATAGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 1440 

45 

ATCACTTATA ACGATTTAGA TG AAG CG ATT CAAATTGCAA ATGATACAAA ATATGGTTTG 1500 

GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 156 0 

so GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 1620 

AAACAATCTG GTTTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 1680 
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AGTGCACATG ACTAATTAAG TTTTG TGTAC TGTTTTAATT TTGCAATTTT TATAAATAGA 18 00 

TTTTGTAATT AAAATAAAAA TTTG CTAT AG TTATTCATGT ATTTAAAAGG TTGGGGATTA I86 0 

5 GCATAATGGG ATTGTG CTAG CACAGTTATT TATGCATTGT CATGCCTATC TATTACTTAC 1920 

TAACTAAAAA ATAATGAAAT GGGTGTAAAC TATATGCCTG AAAGAGAACG TACATCTCCT 1980 

CAGTATGAAT CATTCCACGA ATTGTACAAG AACTATACTA CCAAGGAACT CACTCAnAAA 204 0 

"> GCTAAAACTC TTAAGTTGAC GAACTATAGT AAATTAAATn AAAAAGAACT TGTTCTAGCT 2100 

ATTATGGAAG CACAAATGGA nAAAGATGGT AACTATTATA TGGAAGGTAT CTTAGATGAT 216 0 

ATACAACCAG ATGGTTATGG TTTTTTAAGA ACAGTGAACT ATTCTAAAGG GGAAAAAGAT 2220 

IS 

ATTT 2224 



(2) INFORMATION FOR SEQ ID NO: 4 86: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

ACATTACnTT GAATCGAAAG TTTCATAAAC GACTTGAATG CCAGTTTCTT TTTCAAATTT 60 

30 CTTAATTAAC TCTGGATCAA TATATTCGCC CCAATTGTAT ACGTAAATTT TTTGATTTGT 12 0 

ATGCACTTGT TCTTTAGATT TAAACCAATG ACTTAAAGTA AGACAAAGCA TACCCACAAC 180 

TAATGCACCT ATAAT GAGTT GTAAAAATCG TTTCATTATT TTACACCTCG CTTGATTAGT 24 0 

35 TTTTTCTTAT TTATCACGTA TTGAATCAAA TAATATCCTA GTATTCCTAA TACAATAACA 300 

GCAAACAATA ATGTTGAAAT CGCATTAATT TCCATACTAA TTCCTTTTCT CGCCATAGCA 3 60 

TAAACTTCAA CTGATAACAC ACTAAAGCCA TTACCAGTAA CGAAGAAACT TACTGTGAAA 420 

40 TCGTCTAGTG AATAAGTTAA AG CCATAAAG AATCCTCCTA TAATAGAAGG TAAAATATTA 4 80 

GGAATAATAA TGTTG CTTAA TAATTG TGGT TCAGTCGCTC CTAAATCTCT TGCAGCATTT 54 0 

AACATATTAT TATTCATyTC ATACAGTTGT GGTAAGACGA TAATCACAAC TATAGGTATG 6 00 

CAAAATGCAA TATGAGATAT TAGAACTGTC CaAAAkCCTA AACCAAGACC AGTAAAATGG 6 60 

CCAATCGTTG TAAACATAAT TAAGAAT GAT GCACCTATGA CAACGTCGGA TGATACCATC 72 0 

AAGACATTAT TCAATGTTAG TAAAGTTACT TTAAACTTTT TATTTCTTAA ATAATAAATA 780 

SO 

GCAATGGCAC CAAATGTACC AATAACTGTA GAAATTGAGG CTGCTAAAAG TGCTACAGCT 64 0 

55 
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AATGTAAAAT GTTCAAAGTG AATCATATTA CCAGCCGAAT TGAATGAATA GAACATTAAA 960 

AAGAATATTG GGATGTATAA AATCGCTAAA AGTATCCCGA TATACAG CTT TCCATACCAT 102 0 

TTCATATGAT TCACCCTCTC CCATTAGATG ATTTTGTAAT GATTAAAATG AATGCCATAA 10 8 0 

ATACAATTAA GAATATAGCT ATAGTTGATC CCATACCATA ATTTTGAATT GTTAAAAATT 114 0 

GTTCCTCTAT TGCCGTACCT ATATTTATGA CTTTATTACC TGCAATTAAT CTTGTAATCA 1200 

TAAATAATGA AAGTGATGGA ATAAAGGTTA CTTGAATCCC AGTCATAACA CCTTCTTTTG 1260 

TTAACGGCAT GATTACTTTT CTAAAAGTAT AGAAAGGACT GGCACCTAAA TCACTTGAGG 13 20 

CCTGCAATAA ATTATTAGGA ATTGCTTTCA TGCTATTAAA TATAGGTAAA ATCATAAATG 13 80 

GTATATAAAT GTAACTTGCC ACTACTAAAA ACGCACCAGT TGTAAATAAC AAATTGAATG 1440 

ATGGTAAATT AAATAAGTGG AAAATTGATT AATCACGCCA TCATGACTTA ATAAACCTAT 1500 

AAAAGCATAT GTCTTTAACA ATAAATTTAT CCATGTTGGA ATAATCATTA TCATTAATAA 1560 

GATATTTTGA AATTTCGAAC GAGTAATATA ATAGGCAGnT GGATAACTGA TAGTCAAGGT 1620 

AATAATTGTT ATTGAAGCGG CATATAAAAT TGAATATGCA AACATTTTCA AATATTTTGT 1680 

AGTAAAAATT 1690 
(2) INFORMATION FOR SEQ ID NO: 4 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 487: 

ACGAAAAGAA ATATTATGAT GAACAAAAAG AAAGAATAAC GATTTATATG AAGTACAATG 60 

TGAAAGGTTA TAAAAATATA AGCTTCGCTA ATTTTAAAGA AAACCCAATG GATGGTTATT 120 

CTATTAGTGG TTATATAAAT AAtGaTAAAA AGTTATCATT TACAGCTGGT ATAAGATCTG 180 

TTGATGATTT TCAATTTGAT ACCGATATTT CTTATACAGA TGAATTGGGT AGAAAATTTA 24 0 

ATAAAAATCC TAAGTCAGTT TCTGAAATAA AAAAAGAGCA AAATACGTCC AATAAATAAT 3 00 

TGTT CAT ATT GTGATGAAAC AAAAATATAA GTCATTAGAT GAGTTTAA CT ATGTTATAAA 360 

TATTTGTAGT ATCTATAAAA ATCTCGACAC TATTAAAATG ATAAAGTG CC GAGGTTTTCT 420 

TACTTATTTA GTTAATTCAA AGTTTATGCC AGATTCATAA GAATTTGTGA CACTTTTAAT 48 0 

AGTGTAcCAT TGATTATTAC AATTTATCAA ATGGTCCTTT AGAAGGTATA AATAACAAAA 54 0 



1414 



EP 0 786 519 A2 



TATTATTATG 


TTCAAAACTT 


TACGCTCCAA AAAGTAAAAA GGAAGTTAAG 






TTGCTTAaCT 


TCGGaTATTG 


AACGCATCAG TCCAATTTGA CATAGAGCCT 






TTGATGTTTC 


TCTTTAAAAC 


CTTGCATATT TTACAAAAAG AAAAATTAGC 


AGTATAATTA 




AGACAACGAA 


AATAAGTATT 


TACTTATACA CCAATCCCCT CACTATTTGC 


GGTAGTGAGG 




GGATTTTTAT 


TGGTGCGGCT 


ATATGTCACC TATTTTGTAT TGCGTCTACT 


TAGCCAATAA 




GAAAAAAACG 


CAATGGCACA 


GCCACTGATG ACTGGTGCTA TGATGTGAAC 


GaAAATAAGC 




ATCACCTTAT 


ACACCTCCTC 


TCTGCGTCTA AATTGACGsC TGAGaG rTAG GcGACTCTAC 




TATTATATCA 


TCGGCAAATA 


TACAAGCACA GTCACTTGCT TCTGATAAGT 


TATATGATTC 




TAGCTGATAG 


ATTGAAT CGT 


CTACACTTAA TTGGACAAAT TCTATGAGAA 


TAGATATTGT 








TTTTATTATG ACAAGAGAAA GAAGATCATT 


TAGTTCAGAG 






AAATGG TT AG 


ATTATATAAA AATGGTAAGC CTAGGAATGA 


AATTATACGC 




GAGTATGATT 


TC ACAC CTTC 


GACGTTTGTA AATGGCGGTT ATAAAATGTA 


GGAAAATGGA 




TAAAG CAACA 


TCAAAACACG 


GGTACATTCA ATCACCAAGA TAACTTATCG 


GATGAAGAAA 




AAGAGCTGAT 


TAAATTACG C 


AAAGAAGTTC AACATTTAAA AATGGAGAAC 


GATATTTTAA 




AGCAAGTAGC 


GCTGATTATG 


GGGCAAAAAT AGAAGTCATT CAAAAGAATG 


CACATCAATA 




TTCAGTATCA 


GCAATGTGTA 


AAGTCCTGAT AATACTAAGA AGTACCTATT ATGATTCTAT 




AAAAAGAAAA 


GATAATAAAA 


TCACTAAAGA TGATTCAAAC ATAGAACATG 


CCGTCATAAA 




TATTTTTAAT 


TCTAATAGAA 


AAGTCTTTGG TACAAGACGA ATTAAAAATC 


ATTTAAATGA 




CAAGGGTCTC 


ACTGTATCTG 


GACAAAAGAT AGGT CGATCA TGAAAAAATC 


TAGTTTCTGT 




TTATACGAAA 


TCTAAATACA 


AAAATCATCT AAAAGAAACT AATGAAAAAC 


GAATTAAAAA 




TCTTTATTAT 


TAGCTGCTGG 


TGTATTATTT GTTAGTCCAA TTTCATTATC 


TTATAATTCA 




GATGTAGCTC ATG CTGAAGA 


TAAGTTAGAC CATTCTCAAG CAAAGGTAAT 


ATATTTGAGT 


1920 


AACCAAAATT 


TATTTGATGA 


ACTTGAGAAA AAAGGTTATA AACTGGAAGA 


TATATTTACA 


1980 


AAAGAAGAAA 


TAAAAAAATA 


TAAAG CTGAA GACCAATTGA GAGCGGGTAA 


AACTCAATAT 


2040 


GTAGAAACAG 


GTAAAGATAC 


TGCAACATTA TATCTTTCTT CTGCATATAC 


AAAAACAATA 


2100 


GCTGCTTTAG 


GT 






2112 


(2) INFORMATION FOR SEQ ID NO : 4 88: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

s GTAGGCACAC ATCTGCCATA TAAACATTCT TTTATACTAG TGTTCTCATA TAG TGT AGAG 6 0 

TTATAGTCTC CTTCTTGAAT CTCGAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 12 0 

GTTACTTCTT TTTCAATATC AACTATGAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 18 0 

w GAACACTTAT TTGTATTTGG ATGAAAACGA ACGAATCCAT CACTAAATCC TGTTGAAAAA 24 0 

AATATTTTTC CTTGTGATAG ATCCGGATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 3 00 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC CTCTTGAACA GTAAATTTAT 360 

/5 CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 4 20 

AACAGTTGCG TTCAACATCA TTTGAATAGA ATCT 4 54 



(2) INFORMATION FOR SEQ ID NO: 48 9: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 
TTGTCAGAAT TAGAATGCTT TTGAGTTACT TCATAATACT CATCAGTTTT TTGTGTATCC 60 

30 

TTTTGACTTT TATTTATTTC TTTCCACTTA CCAGTATGAC TTTCTTTTTT TACAGTTATT 12 0 

TTCGGTTTGT TTAAAAAATG ATAACCGATG TTCTTTTTGG TATCTGTGGA CTCTAAAAAG 180 
ACTGAATTGT TTTCCTGATT ATCAGAATTG GTTGTTTGTT TATCGTCTGT ATATAACGTA 24 0 

35 

TACCCATTCG CTTTGCGATT TTCAATCGTT GTAACTGGAT TCTTTGATGA ATCAGTA CTT 3 00 

TTAATGCCAG TTCCTAAGAA GACAATTTTA TCGTTTAATA TGAAATATGA TTTTTTGGCA 3 60 

4Q GTTAAAGTTT TGTCCTGATT TTCAAAATCC ATTC CGATAC TAGCATGTTG GTCATCAACT 42 0 

TTTGTTCCGC CAACAAAAGT TTTACTCGAC TTTTTATCAT CCGTATCTTT TAATATTTCA 4 80 

TTGTCTAAAG TTGTTGTACC TGATAAACGT TTCATATCGG CTGTCACCCA GAAGTTATCA 54 0 

45 TGATAGTGTT TGACATCGCT GTTATATAAA TAAGACATTC CAGCACCAGT GTGCCAACCT 6 00 

TTTAAATTCT CTCCGTTGAT ACTTTCATAG CGTGCTACGT TTTTCGACGT CATACTTAAA 66 0 

CCAAATG CAA AGTCTAAGTC TTTGTTATGA TAGGTGACAC GATCCATGTC ATTATATATT 72 0 

50 TTAAG TTGTT GTGTTAATCC GTTTTTAGAA ATACTGTTAT CTGTCATTAA AGACTTCATT 78 0 

55 
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ACTGAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTG AATCATCCAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 960 

CTGATAGCTC TACCTCGTGA TAAATCCATC ATTTCACCTT TATAAATGAG TGGCATAAAT 102 0 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TCATTTTGGG TTTTATCATT AAAAGGTGTT 10 80 

TCTTTTATCA TCGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACG CC ATAAGCACCA 114 0 

GTGTATGGAA CGTCTTGATG ATCAATGTAA GAGCCAT CTT TATAAAATCC ATTACGTTCT 12 0 0 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 126 0 

TTCATCATAT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 1320 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 13 72 
(2) INFORMATION FOR SEQ ID NO: 490: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490: 

ACAACAATAT AGTAAAGCGT CGTTAATTAA ACAATGGGAC CAATTTGTTC GTCTTATATA 60 

AATGTACTTT aCCTTTATTT TTACAAAAAT AGCATTTTCC TATGTCATTT AACTAAACAT 12 0 

GTAAGTTCGT ATGAACGAGG TTTGTTAAAT AGATGATT CT AGGAAAATGC TTTTTTCTTT 180 

TGACTTAGTT TAAAATATTT TGCCACTTTG TACTGATAGT AGTTGCATTG TACTGTTGTG 24 0 

CAGATTCTAT GCTATTAATT GAAAACTGTT GCAATTTTTG AGTATTATTT AGTAATTGGT 3 00 

CGACCTTTTC AACCATTTGA TTGATGTCAC CTTGAGGTAC TAAATAGCCA TTAAATCCAT 360 

CTTGaATCAG TTCTGaTGGA CCATAATCTA CATCATAACT GATCACTGGT GTACCTACTG 4 20 

AAAgCGACTC TAAAATTGCT AAGCCAAAAC CTTCCATTTT ACTTGTCGAT AACATCAGTT 4 80 

CTGCTTTAGC AATCTCTTCA TTAATATGCG TCTTAAAACC ATGAAATTTA ACATGTTCCA 54 0 

GATATnATGA TAATCTTCTA CAAG 564 
(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1277 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 





TATCCACCCC ACGAnAGCCC CGGAAACTTA TTGTGTTACA AGATATATAA 


GCAGAAACGA 






ACAACAGTTA ACAAAATAAA TGAAATTAAA CGTTTTAAAA ATGAAACAAA TGAAATCATC 






TATTAGGTTA TGAAACTGTT TATAGCTTGA AT AGAAG CAT TTATTTTTTA 


GGAGGACAAT 






TATTATGCGT CAAACATTTA TGG CAAATG A ATCAAACATT GAGCGCAAAT 


GGTATGTTAT 




10 


CGATGCTGAA GGCCAAACAT TAGGTCGTTT ATCATCAGAA GTAGCATCTA 


TCTTACGCGG 






TAAAAATAAA GTAACTTACA CACCACACGT TGATACTGGT GATTATGTAA TCGTTATTAA 




15 


TGCATCAAAA AT CGAATTTA CTGGTAACAA AGAAACTGAC AAAGTTTACT 


ACCGTCACTC 




AAATCACCCA GGTGGTATCA AATCAATCAC TGCTGGTGAA TTAAGAAGAA 


CTAAC CCAGA 






ACGTTTAATT GAAAACTCAA TTAAAGGTAT GTTACCAAGC ACTCGTTTAG 


GCGAAAAACA 




20 


AGGTAAAAAA TTATTTGTAT ATGGTGGCGC TGAACATCCA CACGCTGCAC 


AACAACCAGA 






AAACTACGAA TTACGTGGTT AATTAGAAGG AGGAAATGAC TTTGGCACAA 


GTTGAATATA 






GAGGCACAGG CCGTCGTAAA AACTCwGtAG CACGTGTACG TTTaGTACCa 


GGTGAAGGTA 




25 


ACATCACAGT TAATAACCGT GACGTACGCG AATACTTACC ATTCGAATCA 


TTAATTTTAG 






ACTTAAACCA ACCATTTGAT GTAACTGAAa CTAAAGGTAa CTATGATGTT 


TTAGTTAACG 






TTCATGGTGG TGnTTCACTG GACAAGCTCA AGCTATCCGT CACGGAATCG 


CTCGTGCATT 






ATTAGAAGCA GATCCTGAAT ACAGAGGTTC TTTAAAACGC GCTGGATTAC 


TTACTCGTGA 






CCCACGTATG AAAGAACATA AAAAACCAGG TCTTAAAGCA GCTCGTCGTT 


CACCTCAATT 






CTCAAAACGT TAATTGTCGG ACGATATATA CAAAACACCT CGATATTATG 


TCGAGGTGTT 




35 


TTTTTGGCGT TTTTGCGGCG AATATGGAAT GTGTAGAATA TAAATGAATT 


TTTACCTTCC 


1140 




CACCATAAAA GATGAAGAAC CATGAATGTG GAGAACAATA AATAGTTGGA 


TATTCTGTTA 


1200 




TTTTTTTGGA AGTGGAAGTG GATTTGGAAT ACTTTACTCn AAACGATTAA AAGGTTTAAA 


1260 


40 


AAAACAACAA AnAGAAA 




1277 




(2) INFORMATION FOR SEQ ID NO: 4 92: 






45 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 






50 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 







55 
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TTATTGACAT TGTTTTTATC CAAAATTCAT TGTTAAGACA TTTTCTTTAT GAAATAATAA 12 0 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 1B0 

AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 24 0 

GTTGGACGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 3 00 

ATTTCTCCAT TAGAAGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 3 60 

TACAAGGAAA AATTTGAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 42 0 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 480 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 54 0 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 600 

aTCCAACATT TCCAGACCAA ATCACACCGG kAAACGT CCA AATATATTGn TCTATGCCCG 660 

GTTTACCAGT ATC 673 
(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

AAAAAAAGTT AATGCTGACG GTGTATTAAC TTTTGATATT CTAGAAAATA AATATACTTA 6 0 

TGAAGTTATT AACG CTATAG GGAAAAGATG GATTGTTAGT CATGTCGAAG GTGAAAA CGA 12 0 

CAAGAAAGAA TATGTAATAA CTGTCATTGA TAGGAAATCA GAAGGCGACA GACAACTGGT 180 

TGAATGTACT GCTAGAGAGA TTCCCATAGA CAAGTTAATG ATTGATAGAA TTTATGTTAA 24 0 

TGTAACAGGA TCTTTTACAG TAGAAAGATA TTTTAACATT GTGTTTCAAG GTACTGGAAT 300 

GCTTTTTGAA GTCGAGGGCA AAGTTAAATC TTCAAAGTTT GAAAATGGTG GTGAAGG CGA 360 

TACAAGGTTA GAAATGTTTA AAAAGGGATT AGAACATTTC GGTTTAGAAT ATAAAATAAC 420 

GTATGACAAA AAGAAAGACA GATATAAGTT TGTATTGACG CCTTTTGCAA ATCAAAAAGC 4 80 

GTCTTATTTT ATTTCTGACG AAtCAACGCC AACGCTATAA AACTCGAGGA AGATGCAAGT 54 0 

GATTTCGCCA CCTTCATTAG AGGATATGGT AATTATTCAG GAGAaGAAAC ATTCGAACAC 60 0 

GCTGGGCTCG TAATGGAAGC TAGAAGTGCA TTAGCTGAAA TATACGGCGA CATCCACGCA 66 0 

GAACCATTTA AAGATGG TAA AGTGACTGAC CAAGAAACTA TGGATAAAGA ATTACAATCG 720 
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TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAG TACAAA TAAAATCTAC CAAACTAGGT 84 0 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

GTTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 96 0 

AATACTGCAG CTAACTATGT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 1020 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 1080 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 1140 

TCTGACGGCA CTATCGTTCA TGATTTTATa GATaAATCmA ACATTAAaGA TGTAAAAltl CG 1200 

aTTGGAACGa TTGGCGATtC TGTAGCTAGA GGATCACATG 1240 



IS 

(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 



ACGGTGGATT 


TAG CCTAGAA 


GATTTAACGC 


ATCAAGGTAa 


ATTaTCAGCA 


TTTAGCTTTA 


60 


ATGATCAAaC 


AGGTCAAGCA 


ACATTGATTA 


CTAATGAAGA 


TGAAAACTTC 


GTAAAAGATG 


120 


AGCAACGTGC 


TGGCGTAGAT 


GCAAATTATT 


ACGCTAAACA 


AACATATGAT 


TATTACAAAG 


180 


ACACATTTGG 


TCGTGAATCA 


TATGaCAACC 


AAGGTAGTCC 


AATTGTTTCA 


TTAACGCATG 


240 


TTAATAACTA 


CGGTGGTCAA 


GATAACAGAA ATAATGCCGC 


ATGGATCGGT 


GACAAAATGA 


300 


TCTATGGTGA 


TGGTGATGGT 


CGCACATTCA 


CAAGTTTATC 


GGGTGCAAAT 


GACGTAGTAG 


360 


CACACGAATT 


aACACACGGT 


GTGACACAAG AGACAGCGAA 


CTTAGAATAT 


AAGGACCAGT 


420 


CAGGCGCTCT 


AAATGAAAGC 


TTTTCAGATG 


TTTTTGGATA 


CTTTGTAGAT 


GACGAGGATT 


480 


TCTTAATGGG 


TGAAGATGTC 


TACACACCTG 


GAAAAGAGGG 


AGACGCTTTA 


CGCAGCATGT 


540 


CAAACCCAGA 


ACAATTTGGT 


CAACCAGCTC 


ATATGAAAGA 


CTATGTATTC 


ACTGAAAAAG 


600 


ATAATGGTGG 


CGTACATACG 


AATTCTGGAA 


TTCCAAATAA 


AGCAGCTTAT 


AACGTGATTC 


660 


AAGCAATAGG 


GAAATCTAAA 


TCAGAACAAA 


TTTACTACCG 


AGCATTAACG 


GAATACTTAA 


720 


CAAGTAATTC 


AAACTTCAAA 


GATTGTAAAG 


ATGCATTATA 


CCAAGCGGCT 


AAAGATTTAT 


780 


ATGACgAGCA AACAGCTGAA 


CAGGTGTATG 


AAGCATGGAA 


TGAAGTAGGC 


GTGGAGTAAA 


840 


AATATATAAA 


CaAGAAGAAG 


TAATGTTAAA 


CACTTATAAA 


TAATTAAATT 


TTAAATACAG 


900 



55 
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ATTAGATGAG AGGAGTGTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 102 0 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 10 8 0 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 1140 

TAAATGAaAC ATGCATGAAT TTATTTTTT A ATACAAGAAA CGTAACTACC AAAGGAGTTT 1200 

ACAATATGAA GAAAAGTAAA CGATTAGAAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 12 6 0 

TTTATAAAAA AGAACAAAT c ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 



(2) INFORMATION FOR SEQ ID NO: 4 95: 

(i) SEQUENCE CHARACTERISTICS: 
rs (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 
TGCACTTTCT AAAAATACTT GCTTTACTTG TTCCAATTCC TTGTCAGACA TTGATTCGGC 60 

2S TAATTTATTC ATTATTGCCT CAAATACGTT AATTATGTCA TCCATTTCAG TACTATAAAA 12 0 

ATCAAACCAT TTTG CCGTAT CTTTTTCTCT ATTAAGCTTA TGATCACTTT GAGAGCGCTT 180 
AGCTAATTCT GCATAAATAT ATGGACAAGG TGCCATTGCA GCAATTGTAT AAATAGCATT 24 0 

30 TTCACGACTA TGCGCTTGGA AATACATATG TTTTATGTAA TGGTCGCCAC TTGGAGGCCA 3 00 

AACTTTTGTT TTAATGATTT CTTCGTATGA TTCACCAACA ATTTGCGCTA AAATATCATG 3 60 

CGCAAGTACT TCACCTTCAA CCATAAATTC TATTTGCTCT ACTAAAAATT TTACGTCATT 420 

35 CATGCTATTC ATTTTTGGAA TTAACAAAGC ATATAAGTTT GTAAATTCTT TTAAATACGC 4 80 

AGCATCAGCT TTTAAGTAAT GGCGTAATGC GTCAGcTCCT ATATCTCCGG ATAACATCTT 54 0 

CTGAATAAAG TCATCCTCAT AAATATCATT AATGATTGGC TTTGCAGCTT GGTACAATTT 600 

40 TTGTGAAAAT TCCATTGTAA AAAATCCTCC CTAAATAAAA AAACTACTTC CAACATGAAA 6 60 

GTAGTTTGAT GGCAATGTTG CTATACTAGC CCCATCACTT CAATAACTAC TTTCCTACGT 72 0 

TGGTACTAAC CAAATCAGGT CATAAGGGTC TGAACAATTC ATCTCAGCCA TATCATTAGG 78 0 

45 

CTCCCCTAGT AGTTCCTTAG TATT CAATTG CAAATTAATC TTAGCAAACG GTTTCAACAT 84 0 

TTTCAATTAT TGTTGCTCAG TTGTATTATT ATCTTTAAAT AATAATTCTA TAATGACATA 90 0 

TATTTGCGAA ATAAAAAAAC CGGAACATAT CGAGAATTCC CCGATATATT CCAATCTAAA 96 0 

so 

AGTTACTTAT ATAACTATTA ATTAGCTATG CATAAATGGC TTATG CAGTA ACCCAATGTC 102 0 

55 
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TTGCTGGTGA TACACCTTTA TATTTAGCAG GTGCTACTGA ATCCCAAGTT GATTGTAAGA 114 0 

ATTGATACTT ACCAGCTGCA CCTGgATGTT GGrTTTACAG CATGAATATT GCCACCTGaT 1200 

TCACGTTGrG CAATTTGTTT TAGATGAGC a TTCACATTTA CTGATGAACC TTCTGATGAT 126 0 

TTTGATyCAG TTGGTGTTGC AGTAACTTGT GAATTGTTTG ATGTTGATGC TTGTGGTTGT 132 0 

TGAGTTTGAG CATTTTGTGG TGCTTCAACT TCTTGTGATT GTACTTGATT AGCTTGAACA 1380 

GCTGATGGTG CAACATTATT AGTTGCAGGT GCTTGTGCAC TCATGTCTGC TCCATTAGTA 144 0 

CCTGTTGCAT GGTAATTCCA AGCAAAGTGT GTACCATCTG ATTCAAAGTG ATAAGTAAAC 150 0 

CCTTCATAGT CAAATGTATA ATTATAAGCC CCAGCTTCAA TTGGTTTTTG ATTTAATGTT 1560 

TGATCATTTG ATTGCGCCAT TTGCCTGAAA GATGCTTTAT TTAAGTCCGC TTCACnTGCA 162 0 

TGGGCTTCGT GGACCTGCAT TTCCTGGCTA CGATTCCTAA ACCTACTGGC nAAnGATGAT 1680 

GCGAGTAATG TTTTCTTCAT AATCTTAAAA TCCTCCTACA AGTGAATTTG TGTCTCTAAA 174 0 

AGTTTTACAG TGGACGACTG T 1761 

(2) INFORMATION FOR SEQ ID NO: 4 96: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

TCATTTATGA AAAATGTCGG AnAGAnCCaa GaAAmaCAAT TGAgCGTGAA GAAAAAGCAA 60 

GACTTAAAGA AGAACAAAAG GCACGTCAAA ATGAACAGCC ACAAATAAAA GATGTGAGTG 120 

ATTTTACGGA AGTGCCTCAA GAAAGAGATA TTCCAATTTA TGGGCATACT GAAAATGAAA 180 

GTAAAAGCCA GAGTCAACCA AGTCGAAAAA AACGAGTGTT TGATGCAGAG AATAGTTCGA 240 

ATAACATCGT AAATCATCAT CAAGCAGATC AGCAAGAACA ATTAACAGAA CAAACTCATA 3 00 

ACAGTGTTGA AAGTGAAAAC ACTATTGAAG AAGCTGGTGA AGTTACGAAT GTATCGTATG 360 

TTGTTCCACC GTTAACTTTA CTTAATCAAC CTGCAAAACA AAAAGCAACA TCTAAAGCTG 4 20 

AAGTG CAACG TAAAGGACAA GTACTAGAGA ATACATTAAA AGATTTTGGG GTAAATG CAA 4 80 

AAGTGACACA AATTAAAATT GGTCCTGCAG TAACTCAATA TGAAATTCAA CCAGCTCAAG 54 0 

GGGTTAAAGT GAGTAAAATT GTAAACTTGC ATAATGATAT TGCATTAGCT TTAGCAGCAA 600 

AAGATGTTAG AATCGAAGCG CCAATACCTG GTCGTTCTGC AGTAGGTATT GAAGTGCCAA 660 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 78 0 

ATGAAATG CC ACAC 794 
(2) INFORMATION FOR SEQ ID NO: 4 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 

AG CCAGTTTT GcATTTCGTC AAATCGCAAT GAATATATTG ATTGCTTGTC AAAATTTGGA 60 

AGAAATTTAT TCTTCTTTTT CTTTTCAGCA GTTATCATTC CTAATTGTC C TTTCTTTTTA 12 0 

TCTTAGTGAT AAAGAATCCA TCTGAATTAA AGTCTTGCGG CATGATTTGT AACGTTTTGA 180 

CCAACTCTCC AGTTATCGGA TGTTGAAACG GTTCAAATTC GAAGTTTTTA TTATTTTTCA 240 

AAAACGTATA AATCACGTTT TCATTTTCTA GTTGCTCAAT TGTACATGTT GAATAGATGA 300 

TTTCTCCACC TATTTTTACA TTGTTTTTTA CATTTTCCAA TATTTCAAGC TGTAATTCAA 360 

CTAGTGACTC AATATGTTGT TTG CTTTGAG TATACTTAAT CTCCGGCTTA TGTCTCATTA 420 

CACCTAATCC GCTACATGGT GCATCAACAA GTATCTTATC GTATGTTTTA TCATAAGGTT 480 

TTGTCGCATC ATGTTGAAAA GCTTTAATAT TTGTTAATCG TAATTnTTTT ATATTAAAAT 540 

TAATTAAGTC TATTTTGTGA TCATGTATAT CTGAAGCGTC AACTTGC CCT TCTGGCATTA 600 

AAACTTCAGC AATGTGACAA GCTTTACCGC CAGGTGCACT ACATGCATCT AATACGTGAT 660 

CATGTCGGTC TACATTCATA ATGTGTGCAa CAAACATTGA GCTTTTATCT TGAATTGAAA 720 

CGAATCCATC TTTAAATGAA CGAGAATGAA TAATTGGTTG TCCTCCTATA TGGAGACAAT 780 

AAGGTAAGTC ATGATCTTTT TCAACGTCAT AACCTTCGTC TTGCAACTTT TCAATAATAT 840 

CATCTAATGA TGCTCGCGTC AGGTTGGCAC GCACAGTTGT TGATGTCGTT TCTAAAAATG 900 

ACTGTAAAAT TTTTTCAGTT TCTTCGAGAC CATAATGTGT TGCCCAATGA TCTATAATCC 960 

ACTTCGGCAT ACTATACTCG ATTGCCATTC TTTTTTTAGG ATCTGCAATT TCATTAAAAT 1020 

CAGGTAAGTC ACTACGCATC ATTGTACGTA AAATACCATT TACGACATTA CCATTATGAT 1080 

AGCCACCGCG TTCTTTTGCT ATTTCAACTG CTTCATTAAT AATGGCATGA TTTGGAACTT 114 0 

TATCTAnATA nACATATTGA T 1161 
(2) INFORMATION FOR SEQ ID NO: 4 98: 
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(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 98: 



10 


AGCTCACGTC 


ATCTTCGGCG 


GCGCTAAATT 


AAAATAATCA 


ATTTCTGAGT 


TAAACTTTTA 


60 




TTTACAACAT 


ACTATTACTA 


TACATTACAA 


ATTTTTAAAA 


TATACATTAC 


ACTCATTACT 


120 




CAATGGmAAG 


CGTATGATTT 


CmCAGCCCCC 


CTAGCTTGTA 


GAAATCATAC 


TTTCCTTTTT 


180 


15 


TCAATATATA 


TACAACTATT 


AAATCCCATA 


AGATTGCAGA 


GCACATAAGT 


AAATTTTTTT 


240 




AGAGCTTGAG 


GTTTGTTTAG 


CTTAAGCAAC 


CCATGAGCTC 


AAACACTTCC 


TGTTCACATA 


300 




ACACTACAAA 


TCGCATTATG 


TTGCTTAATC 


TTATGTTTAT 


ATAAATTACA 


CACAATAAAT 


3G0 


20 


AGAAAGAATG 


TGAACATCAT 


GAATAAATTA 


TTGCTACTCG 


TTACATTTAT 


CATT CGTGTG 


420 




GGTTCAGGTA 


TTGTTATGTT 


AATGCAAGGC 


TACGAAAAAT 


TAACGGGCGG 


ATTTACGCTG 


480 


25 


AAAGGTTTAG 


TACCAGTCAT 


CGCTAACAAT 


ACTGATTCAC 


CAGAGTGGTA 


TAAGTGGTTT 


540 


TTCGCAAATA 


TAGTTGCACA 


TACGACGTCA 


TTATTTGATA 


TTGTTGTCCC 


ACTCGGAGAG 


600 




ATTGCAATTG 


GATTAGGTTT 


AATTTTTGGA 


GTTTTTGCAT 


ATGCTGCTAG 


TTTCTTTGGA 


660 


30 


GCCTTTGTTA 


TGATAAATTA 


TATCTTAGCA 


GATATGATAT 


TTACGTATCC 


TCTTCAATTA 


720 




ACTTTCTTTA TCCTTTTACT AATGAGTCAC TCATTGTTAA AACAGATTTC ACTTAAAGAA 


780 




ATCATTAATT 


ACTTTAGAGG 


TCGTAAGAAC 


AGAGGTGAAA 


AAATAGATGA 


CCCACTTACT 


840 




GATCGTGGAT 


GATGAACAAG 


ACATTGTAGA 


CATTTGTCAA 


ACCTATTTTG 


AATATGAAGG 


900 




TTACAAAGTA 


ACAACGACAA 


CTAGCGGTAA 


AGAAGCAATT 


TCTTTACTAT 


CAAATGATAT 


960 




TGATATCATG 


GTACTTGATA 


TCATGATGCC AGAAGTTAAT 


GGTTACGACA 


TTGTCAAAGA 


1020 


40 


AATGAAAAGG 


CAAAAATTAG 


ATATCCCCTT 


TATCTATTTA 


ACTGCCAAAA 


CACAAGAACA 


1080 




TGATACCATT 


TACGCCTTAA 


CTTTAGGTGC 


AGATGACTAT 


GTCAAAAAAC 


CATTTAGTCC 


1140 


45 


AAGGGAACTC 


GTTTTA CGTA 


TTAATAATTT 


ACTTACAAGA 


ATGAAGAAAT 


ACCATCATCA 


1200 




ACCAGTTGAA 


CAACTGTCGT 


TTGATGAATT 


AACACTTATT 


AACTTAAGTA AAGTtGTGaC 


1260 




t GTAAa TGGT 


CACGAaGTCC 


CTATGCGTAT 


TAAGGAATTT 


GAGTTATTGT 


GGTATTTAGC 


1320 


50 


TTCTAGAGAA 


AATGAAGTTA 


TTTCTAAATC 


AGAATTACTT 


GAAAAAGTTT 


GGGGATATGA 


1380 




CTATTACGAA 


GATGCTAATA 


CCGTGAATGT 


CCATATACAC 


CGTATTAGAG AnAAATTAGA 


1440 




AAAAGAGAGC 


TTTACAACAT 


ATACCATCAC 


AACTGTATGG 


GGATTAGGAT 


ATAAATnTGA 


1500 
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(2) INFORMATION FOR SEQ ID NO: 4 99: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 99: 

ATTGAAAGCG ATAATTCGTA nTAATTGAGT TTGTTGAAAA ATTTAGGGTA ATGTAAAGAT 60 

ATAAAAGATA CATAGAyTGG AGAGATATAA AGATGTTGAA TGAGATACAA ATATTAaATA 120 

15 

aTGGATACCC GATGCCTTCA GTTGGGTTAG GTGTTTATAA AAT CT CTGAC GAAGATATGA 180 

CTAAAGTTGT AAATGCTGCA ATTGACGCAG GCTATAGAGC GTTTGATACA GCATACTTTT 240 

20 ATGATAATGA GGCTTCACTA GGACGAGCAT TAAAGGATAA TGGCGTCGAT AGAGAAGATT 3 00 

TGTTTATAAC AACGAAGTTA TGGAATGACT ATCAAGGTTA TGAGAAAACA TTCGAATATT 3 60 

TCAACAAATC GATTGAAAAT TTACAAACTG ATTATCTTGA TTTATTTCTA ATACATTGGC 4 20 

25 CTTGTGAAGC AGATGGT CTA TTTTTAGAAA CATATAAAGC TATGGAAGAA CTTTACGAGC 4 80 

AAGGTAAGGT AAAAGCAATA GGTGTATGTA ATTTTAATGT TCATCATCTA GAAAAATTAA 54 0 

TGGCTCAATC AAGTATCAAA CCAATGGTGA ATCAAATTGA GGTACATCCA TATTTTAACC 6 00 

30 AACAAGAATT ACAAGAATTT TGTGATCGTC ACGATATTAA AGTGACTGCA TGGATGCCTT 660 

TGATGAGAAA TAGAGGACTA CTAGACGACC CTGTCATTGT TAAAATTGCT GAAAAATATC 720 

ATAAAACACC AGCACAAGTT GTATTACGTT GGCATTTAGC ACACAATAGA ATTATTATTC 7 80 

35 

CAAAATCTCA GACACCTAAA CGCATTCAAG AAAATATAGA TATTTTAGAT TTTAATTTAG 84 0 

AATTAACAGA AGTAGCTGAA ATTGATGCTT TAAATAGAAA TGCAAGACAA GGTAAAAATC 900 

4Q CAGATGATGT GAAAATTGGG GATTTAAAAT AACTGGATGT TAAATTTTAC GTTTATGAAT 96 0 

GCCTTTTAAT GTGTACATTA AAATAAATGA GTTGGTTTTT ACTATTTGAT AAAACAATAC 1020 

TCAGGTACAT TCAAAATCTT TTAAATAAAA AGGATGGACA TAGATGAAAA TTAGAGTCGT 10 80 

45 CATTCCTTGT TTTAATGAAG GGGAAGT CAT TACACAAACA CATCAACAAT TAACTGAAAT 114 0 

ACTTTCACAA GATAGTAGTG TGAAAGGCTA TGATTATAAT ATGCTTTTCA TAGATGATGG 1200 

TAGTACGGAT ACCACTaTAG ATGAAATGCA ACATCTIGCC ACAATAGATA GGCATGTCAG 12 6 0 

50 

CTTTATTTCT TTTAGTAGAA ATTTTGGAAA AGAAGCAGCT ATGATTGCAG GTTACCAGCA 132 0 

TAGTACTGAA TTTGATGCAG TCATCATGAT AGATTGT GAT TTGCAACATC CACCTGAATA 13 80 

55 
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TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 150 0 

TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 156 0 

AAGAG CTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGnTTTTCAA AAnGGnTATT 162 0 

TGA 1623 
(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

AAAGTnGGTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 6 0 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 120 

ATGACACATC GCGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 18 0 

TTACTAACAT GCGGTAAACC TGGACAAAAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 24 0 

GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC 300 

ATTCATCATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 36 0 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 420 

TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa 4 80 

GAAAGAAGTA TTAAT CAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 54 0 

ACAGCAATTT CTGCATACAA CTTAGG t TAT AAAATTTCAG TAC CTGCTGA GGGAGTGGCT 60 0 

CATTT 605 
(2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 501: 

TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 6 0 

GTGTAAGATA TAGTGCAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG 12 0 
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TGAGAGAAAA GTTTAGACAC TATTGTGAAC ATGAAGTTCT AAGTTCAATC ATCAATGGTT 24 0 

CATACATTAT CGTCAAAACC TCACCTGGTT TCGCCCAAGG CATAAACTAT TTTATCGATC 300 

AGCTAAATAT AGAAGAGATA TTAGGTACGG TGAGTGGAAA TGACACTACA TTAATCTTAA 360 

CTGCCTCAAA TGATATGGCA GAATACGTAT ATGCAAAATT ATTTAAATAG A CATG TAT C A 42 0 

AATGAATAAT AAAAATTTGT TTCGTATCAC GTGTACTCAA GTTAGTTACC AAATATTAAC 480 

TTGTGTACGC GTTTTTTTAT GGAAAGAAAG AATTCATAGT CATTCAATTG ACTGTATAAA 54 0 

AAACTTTATA CAACATGTTT TTATGGGTAT TTTTGAATAA AAAATGTATA TTTTGACCCA 600 

AAATACCTTT ATTTATGTAT AAAAATCCAT TATTATGTAT TGTATAACAA AAAGATATGA 66 0 

AATTTTCGAC TTTCTTTATG TGAATATAAT CACATGTAAG CGTTTGAAGA TTGT CTATAC 72 0 

TCTAAATGAA TTCAAAGATA AAAGGAGGAA ATAGACATGA CAGATGGTCC AATTAAAGTA 780 

AATAGCGAAA TTGGAGCTTT AAAAACTGTG TTACTTAAGC GTCCTGGaAA AGAATTAGAA 84 0 

AATTTAGTAC CTGATTATTT AGATGGATTA CTATTTGATG ATATTCCATA TTTAGAAGTA 900 

GCTCAAAAAG AGCATGACCA TTTTGCGCAG GTGCTAAGAG AAGAGGGTGT TGAAGTACTT 960 

TACCTTGAGA AGTTAGCAGC TGAAAGTATT GAAAATCCTC AAGTAAGAAG TGAATTTATT 102 0 

GATGATGTAT TAGCAGAGTC TAAAAAAACA ATATTAGGTC ATGAAGAAGA AATTAAGGCA 1080 

TTATTTGCGA CACTTTCTAA TCAAGAACTT GTAGATAAAA TAATGTCAGG GGTACGTAAG 114 0 

GAAGAAATTA ATCCGAAATG TACACATCTA GTAGAGTATA TGGATGATAA GTATCCATTC 120 0 

TATTTAGATC CAATGCCAAA CCTTTATTTT ACTAGAGATC CACAAGCCTC AATAGGACAC 126 0 

GGTATAACAA TCAATCGGAT GTTCTGGAGA GCACGACGAC GAGAATCAAT ATTTATTCAA 13 20 

TATATTGTAA AGCATCATCC TAGATTTAAA GATGCGAATA TTCCAATCTG GTTAGATCGA 1380 

GATTGCCCAT TCAATATTGA AGGCGGCGAT GAACTTGTTT TATCTAAAGA TGTCTTGGCT 14 4 0 

ATAGGCGTTT CAGAACGTAC AT CTGCACAA G CTATTGAAA AGTTAGCGCG ACGTATTTTT 1500 

GAAAATCCGC AGGCGACGTT TAAAAAAGTA GTAGCAATTG AAATTC CAAC TAGTCGAACT 1560 

TTTATGCACT TAGATACAGT ATTTACAATG ATAGATTATG ACAAATTTAC AATGCATTCA 162 0 

GCCATTTTAA AGGCAGAAGG CaATATGAAT ATATTTATTA TTGAATATGA TGACGTAAAT 1680 

AAAGATATTG C CATC AAA CA ATCTAGTCnT TTAAAAGATA CTTTAGAAGA CGTACTAGG 1739 
12) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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60 
120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 
CTGTACATAC AGCAATATCG TTAACAAATG AAAACAGTAT TTTAGGATTG TAAACCATGA 
TAAACCTAAA ATACTGTTAT TTTTATTACT TAAATTTCTT CTTCAATGCC TTTTCAACAT 

AAGGTGGAAC GAATTCAGAA ATATCTGCTC GATAAGCTGC AACTTCTTTA ACAATACTTG 18 0 

AACTTATAAA TGAATAATTA GTACTAGACA TCATATATAA CGTTTCAATT TCATTGTTCA 24 0 

ACTTTTTATT CATTGAAGTT AAGCGTAATT CATATTCAAA ATCACTGACT GCTCTTAAAC 300 

CACGTATGAT TGTTTTAGCT CCTACTTGTT CACAATAATC GACTAGTAAA CCACTAAATT 36 0 

GATGAACCTT GACATTAGGT AAATGTTTAA CAGATTGTTC AATTAAATCC ATACGCTCTT 420 

CTAAACTAAA CGTACCTTCT TTTTTACTAT TTTTAAGAAC ACAGACATGA ATTTCATCAA 48 0 

ATCTATCTGT ACTTCTCTCA ATAATGT CTA AATGACCATA AGTAATGGGG TCAAAACTAC 54 0 

CCGGAATGAC CGCTATTGTA TGTTCCATGC TATTCTCCCT TTT CTAATAA CAATGTGTCT 600 

GTCAACCCAT AATGGTAACG TTT AAT CAT A TTAAACGGTT GATAATCtAT TTCTTCATGA 660 

TTGCTAAATT CACAAACGAT GATACCATTT TCTTTCAATA AATTAAACTC TGAAATTAGT 720 

TTTAAAGCTT TATCAATGAG ACCTTTATTA TAAGGTGGAT CTAaGAAAAT GACATCAAAT 7 80 

TGAATATCAC GTTTTGACAA TGCTTTTAAA GCTCTATCTG CATTATTTTT ATAAACTTCA 84 0 

GATTGTGCCT CTAAATCCAA ATTCGCAAGA TTTGaTTTAA TAACTTTTAC AGCTTTAAAA 9 00 

TTTTGaTCAA CAAAGATTAC CTTATCCATA CCTCGAGAGA GTGcTTCTAT TCCAAGCGCC 9 60 

CCGCTTCCTG CAAATAAATC TAAACCTATA CCTGACACAT CATATAAACT ATTAAAGATA 10 2 0 

CCTTCTTTAA CTTTATCCAT AGTTGGTCTC GTATTACGGC CTTCCATACT TTCTAAAGCT 10 80 

TTACTTTTAT GTTTACCTGC AATGACGCGC ATGTTGTTCA CACTTCCAAT TCATTTAGTT 114 0 

ATTTAATATA ATTTATTGAG AAAAAGGAGA ATGATAAACC AATGAAACAA ACATTTATTA 12 00 

CACTTGGTGA AGGTCTAACA GATTTGTTCG AATTCATGAC GATGATTGAA TATAACCATC 126 0 

AACGTATTGA TAAAATTATC TATTTTCATT CACCACAAGC TGAAAATAAA AAGTCATCTG 132 0 

TAGCAATCAT TATGAACCCT ACAACTGGCA ATCATTTCCA AGCATTTTAT ATCATGATAA 138 0 

ACGCTATTAA ATATCCATAT CCAGATTCAA ATAAAAAGTT TCAAATGATA AATGATTGTG 144 0 

CTGAAAAATT CGACATACCA ATTTTAGGTA TCGATGTACA GCCCCCTCAA GCATTTCATG 150 0 

ATTTATCGTT ATATTATAAT TATTTAATTA GTGTGTTAAG GCTCCAAAAA TGGATACCAG 156 0 

aACTTCAATA ATATTAATTA TATATTTCGT GTTTCTCTTT TTCGTAAGTT TTCTTTAAGT 162 0 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATT TGAAT 1740 
GAACG 1745 
(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1035 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

TCGTCTTTAA TCTTGCTGAC TTTAGAAGGC TTACGAGTGC GGTTACCATT TTTTGCATCT 60 

TTAACTGATT GAACTAAAGC Tt GACGTGTA GATTTATCAG CTAAACTAAT TGCACCACCA 120 

ATTACGGCAC CAATTAAAAT ACCAGGAACA AATTTATTTT CCATAAAAAA CTACCCCTCT 180 

TTCAAATTTG CATCTTTTAC GATGTAGTCT ATTAAATTAT CACAAGATGA TAATACCATG 24 0 

TCGTATACAC CTTCAAAATT ATTCGTGTAG TATGGATCTG GTACATCACT CTCTTCCATA 3 00 

TTACTAAATT CTAACAGTTT GAACAATTGT CCCTTAAGAT TAGGATTGAT AGATTTAATA 3 60 

TTATCAACGT TACTTTGATC CATAGC CACA ATGTAATCAA AATCATCTGT CGCTT CGAAT 4 20 

AATTCACTAA TCATGCCATC AAATGGAATA TTGTGTTTGT TGAGAATTTT TTGTGTACCT 4 80 

TCATGAGGTG GCTCTCCTAA ATTCCAGCTA CCAGTACCTC TTGAATGTAC TTTAATATCA 54 0 

TGAATATTTC TGTCTTTAAG TCTTTGTCGC ATGATTGCTT CTGCCATTGG AGAACGACAT 6 00 

ATATTGCCAA GACAGACAAA TGCTACATCT ACCATTTTGA TTCCTCCAAA CTATGTAGTT 660 

ATATCCCCAT TTTAT AG CG A CTTTAAACAA TAAGAAAGCA GATTATATAA AATTCTATTA 720 

AAGTTTATTA AATTGTGATA CTTTGATAAC ATAACTATTA TTAGAGGTGA ACATTGTGGC 7 80 

TATGACAAAT GAAGAGAAAG TnTTAGCTAT TAGAGAGAAG TTAAATATtG TTAAT CAAGG 84 0 

ATTATTAGAT CCTGAAAAAT ATAAAAATGC AAATGaAGAA GAATTAACAG ATATATATGA 900 

TTTTGkTCaA yCAAGAGaAA GATTGTCGCC AAGTGaAGTG mCAGCTATTG CTGaCGCTTT 96 0 

AGGACAATTG CGACACGAAT AGGAGTGGGA ATTTTGACTA ATTACAAAGA AAAGTTACAA 1020 

CAATACGCTG AACTA 10 3 5 
(2) INFORMATION FOR SEQ ID NO: 504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12B4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

5 AnCCTGACAA GATT CTTATA TTCATAGATA TGAGAG CTAA AAATGAAATC AACAAACAnA 6 0 

CAATAAAGTA AACGATAATA GCCCATATAC CATTTTGTAA CCATATTACA AATTGTGTTG 12 0 

TATTATAGCC ATTTCCAGCT AATAATTGCT GGATAAATGC ATTATTGTTT AATG TATTTT 18 0 

to 

CAAGATTAGC AAtCGATGTG TTATTACTGA ATGAAACAAG TGCTATAAAC ATCGTAATGA 24 0 

CAGTAAGCAC TAATAACATC ACCCAACATA ACCAACCTAG AACTTTTTCA GTTAATCTAC 3 00 

1S TTACTGGACG TTTAATTTGA GTAAATTGTT CTCCAGTCAT TCGTTACAAC TCCTTATAGT 360 

ACTTATCCCG TTATTATAAC TAAATATACA GTAAATAACT ACTATTTATG ATTTTATTTT 42 0 

AATGACATTT TGAAATTCAA AAAGTTTTCA TTGTATTCAC TTAAAACTTC AGGT CCTAAA 48 0 

20 TCTTTATAAA CTTCAAGGCG TTCTTGCTCT TTCTTAGTCG GATAAAAACG ATGGTCGTCT 54 0 

TTAATCTCTT TAGGCAACAA TTGTCGAGCA GCCTTGTTTG GCGTTGCATA GCCTACGAAT 600 

TCTGTATTTT GCTTGTTATT TTTAGCATCT AATAAAAAAT TCATAAATTT ATATGCACCC 66 0 

25 TCTTTATTTT GTGCCGTTTT TGGAATTACC ATATTGTCGA ACCATAAATT CGATCCTTCT 72 0 

TTAGGAATAA CATAATTATA TTTATCCCCT TCTTGCACTA GAGGTGCTGC AACACCACTC 780 

CAAACAACCG CTATGTTACC TTCATTTTGT TGAAGCATCA TGGTAATTTC ATCACCTACG 84 0 

30 

ACACCTCTTA CTTGTGGTGC TAGTTTGGTT AAATCTCGCT CTGCTTCTTT TAAATGGTGC 90 0 

GAATTACGGT CATTAAGATT ATACCCAAGT TTATTCAAAC TCATG C CTAT AATCTCTCTA 96 0 

GCACCGTCAA CTAGTAAAAT TTGGTTTTTA AATTTAGGAT TATACAATGA CTTCCAACTA 1020 

35 

TCAAATGATT CATTTGGATA CTTTTCTTTA TTATATAAAA TACCTACAGT TCCAAAGAAA 1080 

TAAGGTAAAG AATATTTATT GCCTCTATCA AATGACATAT TCATATAATC TGAATCTAAA 114 0 

40 TTTTTAATAT TAGGTACCTT ATTATGATCT ATTGGTAACA ATAAATGATC TCTTTTCAAT 120 0 

TTTTGAACTG nATATTCACT AGGAnAAGCA ACATCATAAT GTGTACCGCC AGTG CGAATT 1260 

TTGGnGTCCA nCGCTTCATT TGAA 1284 

45 (2) INFORMATION FOR SEQ ID NO: 50 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 
SO (O STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 
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ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 180 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 24 0 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GtCAACCATT GCTTATTTAA 3 00 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 360 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 42 0 

TAAAACTGTT CTT CTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 4 80 

AATGCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 540 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 600 

GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 660 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 780 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 84 0 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 90 0 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 114 0 

TTCATCATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 12 00 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

TATCACCTCT TTAATGT CAT ATTATTCATA ATATATCAAA TATCCGACAA AACGCCAATA 132 0 

AAATGCTATG ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC 1380 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACGA TAACAAAGAA ATTTCATCAT 144 0 

A TACT AAA CA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 1500 

GCAGACCTTA TCGTGCAAGT CAAATGTATT AT CATG AATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATT CAAGG ATTACAACAA TATCAAGTAT 1680 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 174 0 
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AAGAATCCCC TACCTCAATT TTAATTGAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 186 0 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGCATCG ACG CTGGGGC GCACCATTCC 1920 

5 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAGAC 1980 

AATTTTTAAA TATTGACCGA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 2 04 0 

w AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG 2100 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG GATTGGTCGA TATTTGAATG 2160 

ATT T C'TTTAA tttAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 2220 

15 TTTAAAACAA TAGGAGGTAA GTTACGATGC C CAAAATAGT CGTAGTCGGA GCAGTCGCTG 22 80 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 2 340 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 2400 

20 TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 2460 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2520 

CTGTATTAAA TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 2580 

25 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 2640 

ATTTAGAAGA CACTGATGCT ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2700 

TGGTTGTAGG TGCAGGTTAT GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 2760 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2 820 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2880 

3S TTAATGCTAT CAATGGAAAT GAAATTACAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2 94 0 

TGATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATATCA 3000 

AACTTGATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3060 

40 TTTATGCAAT AGGCGATATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTG CTGGAA 3180 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA 3 24 0 

45 CATTTGCGAG TGTCGGCGTT AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 3300 

AAGTCACTCA AGGTGCACAC GCGAATTATT ACCCAGGAAA TTCCCCTTTA CACTTAAGAG 336 0 

TATATTATGA CACTTCAAAC CGTCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 342 0 

50 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 348 0 

TAACTGAGTT TGAAGTGGCT TATGCACCAC CAT AT AG CCA CCCTAAAGAT TTAATCAATA 3 54 0 

55 
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GTTAGAATTA TGTTGGACTG GTACTACTAT CCAGTCCATT TTTTATGTTT AACATTTTTA 36 6 0 

GAATCAAAAA AGACATAAGG TCTTGGACTA ATAATTGTCC ATGCCTTATG TCATATACTA 37 2 0 

5 

TATGTCTTAT CAATTAGCCA ATACCGAATA ATTTTGATAT AGGsCCTAAC GGTAGAATGA 37 8 0 

CACCTAATAC CATTGTGATG ATAATTAATG CAATTGTTAT CCAAAACATT GTGTGACTTT 384 0 

^ GTTCATGTCT CTTTCTTTTA GCAATCGACA CTTCCATCAA TCCAACTACT GCAACACCAC 390 0 

ACAGCATTTT CAATGTAAGC AACATATGAT TTGCCCCGCC ATTCATAAAT GACTGAATTA 39 6 0 

ATATCCAAAA TCCTGAAATT AACGTCAACA GCATAAATAA GCGTAAAATC ATGTGCAACG 4 02 0 

, s GTTTGAAAAA TGGTGATCTG CCTTGATTTT TTGAAATGTT TAAGTATGTA GCGATAAATA 408 0 

AAATAATCGC TAATACCCAA CTTAATATAT GTAAATGTAA CATACTGATT CCCCCCACTT 414 0 

TAATTATTTA TATTATTAAA TTAAAGCTTC TTGGGATTAA TACCCACTTG CTTGTAATTT 4 20 0 

20 AATCATGATT TGATTATACA CGAATATATA TTCTACCACA CTTCTATATT TGAGAGGAAG 4260 

AACATGACAT TTTATTCCTT ATTAGAATAT TGTGAATCTG CTGTAAAATA ATCAACTACT 4 32 0 

TCTTAATATC AATATTTCAC TTTCATCTCA AAATGGTAAC ATTATAAATA ATTTATCTTT 4380 

25 

AACACCTTTT TAGAAAAGCA AGAAAAAACT AACCAATCTA TATAAAGACT GGTTAGCTTT 4440 

TTAAATGATA ATTATTTAGC GATATAAGTT GTCAGCGTTC CAATATTATC AATAGTCACT 4500 

TTAACTTCAT CACCTGGTTG TAAAAATTTA GGTGGTTGCA TACCTGCACC AACGCCTGCT 4 560 

30 

GGTGTACCAG TTGCAATAAT ATCTCCCGGA TGTAGTG CAA CATATTTTGA AATTTCTTCT 4620 

ATTAATTCAT CAATTTTAAG AATCATTTCG CCAGTGTTAC CATCTTGTCT AATTTCATTG 46 8 0 

35 TTAACTTTTG TAACAATATT TACATTTTCA GGTAATGGTA GTTCGTCTTT AGTAACGATA 4 74 0 

TAAGGACCCA TTGGGCAACC GCCAGTTAAA CTTTTTGATA AAAATGCTTG ATCTTGTTCA 4 800 

CTTTGTGCTT TGCGATCAGT GATATCGTTA ATAATTGTAT AGCCGTAAAC ATAATCTAAA 4 860 

40 GCTAATGCTT TTGGAATCTT TTCACCAGAC TTACCAATAA CAATACCTAA TTCACCTTCA 4920 

TAATCTAATT GATCAGTAAT AT CTTTATG A TTTGGAATTG TTGCATTATC TCCTGTTAAA 4980 

GATGACGCTG CTTTTGTAAA TACATATAAT TTTTCCACTT CATGATTTAA TTCGTTCGCA 504 0 

45 TGATCTTTGT AATTTCTACC AAAAGCAATC ACATTATTCG GAGGTGTTAC TGGTGGTAAA 5100 

AATTCAATGT CATTAAATGA AATTTTATAG TCTTCAGCTT TGCCGCTATC TTCTGCTGCT 516 0 

ACAACTGCTT TACGTACTTG TTCTTGAAAA TCTAAAGTAT GATTTTGTTG TAAACCAGCT 5 220 

50 

AACAATGTTT TAGGATGGAA ATCTCCTTCT G CAAAGT CAG CAAATACTTG TGTTAAATCC 5280 

CATACAGCAT CTTCGCGTTT TACTTTAACG CCATATGAAG TTTTGTCATT ATACTTGAAT 5 34 0 

55 
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TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 54 6 0 

TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 5520 

TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 55 80 

ATCGTTTGAA GCGTATTTTG ATAATATTTC AT CTGTAT CT TATATTTATT TTTTAATTGT 564 0 

GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 57 00 

ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 57 60 

ACG 5763 
(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

CCATATGATT TTGT GCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 6 0 

CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC AGTGTATTTC 12 0 

TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 180 

GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 24 0 

GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 3 00 

CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CAT CTCAG AA AGTGCAGTAG 36 0 

TTGTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 42 0 

AT 422 
(2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS .- 

(A) LENGTH: 1188 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 6 0 

ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 12 0 
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TTATCTTGGC AAGAAAGCAG AGACAAACCG CATTACTGGC ATTTCTAGTA AACCACCTAT 24 0 

TTTACTAACA CCATTATTTC CAACTTATTT TTTCCCAACA CATTCTGACA GACAAAATGA 3 00 

5 AAATATTTGG TTAAATATGC ATTATATCGA AAGTATTAAA GAATTAAAAA ATCGTAAATG 360 

TAAAGTGACA TTTATTAATA ATGAATCAAT CATTCTTCAT GTTTCATACC ACAGTTTATG 42 0 

GCATCAATAT AACAATTCCA TTTTTTACTA TTACATGGTA GATAAACAAT CTCGCATGAT 4 80 

to 

ATCAAAAAAT CCCGACCAAC CAATAGATTA TAATAAAGCC ACATTGAATG TGTTTGAAGC 54 0 

ATTGACACGC TATTCTTTAT TTGAAGATAA ATAAATTGTT TATTTTTAAA ATATGCGGAA 600 

)5 TGTTTTATAA ATATAGTGTA AATGTTCTGC ATATTTTTTT AAGGTATCTA TTGCAAATTA 660 

ACTTAATCTT GTTATAATAA TATTTGTG CT TGaTATTCAA ACACATACAA ATTAATCCAC 720 

AGTAGCTCAG TGGTAGAGCT ATCGGCTGTT AACCGATCGG TCGTAGGTTC GAGTCCTACC 7 80 

20 TGTGGAGCCA TTGGAAACGT ACTCAAGTTG GCTGAAGAGG CGCCCCTGCT AAGGtGTAGG 840 

TCGCGAAAgG CGCGAGGGTT CGAATCCCTC CGTTTCCGTT ACTTGCTAAA ATGGTATATA 900 

CCATTTTAnC TTTTTTGTTT ACTTATATAT AATGAATGAG AATTTCACTG TTCTTTTATA 96 0 

25 TCAATTTTAA AATTCTAAAA ACCTTTCCTA GATAATCTTC TCTAAGAAAG GTTTTTATAC 10 20 

TTGTTGAACT TATAATTAAT TTATTACATA GCAATATTTA CCTGTTTTTA ACTATAAAAT 1080 

TATCACTACA TGAAATACGA TAATTCGGaT CTCTTAACTT CTCTGCaATT AATGtACTCA 114 0 

30 

TTGgTTTCAT CGTATGATTC ATGTATAATA GCATTTkTTA AATAATTC 1188 

(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGGTTCG AGGTGGAAGC 60 

45 ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 120 

TTGCGAACAA AAt CACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 180 

GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 24 0 

SO 

CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 3 00 

TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 3 60 

55 
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GGCGGTAACA CGGGTTCGAG TCCCGTAGgA GTCATACAAG CAGAAGTGAA ATATCGCTTC 4 80 

TGTTTTTTTA TTACATATTT ATTGTTGAGG AAGGTTGTCC GAGCTGGCCG AAGaGCACGC 54 0 

CTGGaAAGTG TGTAGGCGTC ACAAGCGTCT CAAGGGTTCG AATCCCTTAC CTTCCGTAAA 6 00 

GGcGCTTAAA TTGGTTTTAC CCATTTTAAG CGCTATTTTT ATTTTGGACT CAATCCCTTG 660 

ATATATCTGC ATTTGAGCTA TTATCCTCAT TTTTACACTT CTTATTTATT TATATCCATT 720 

TAAAATTTTT TAGCCACAAT GTGACTAATT TTTGaTGAAT AATCCTAATT TTAGtCACAA 7 80 

GATTTTGAAG TTTAGTCACA AAACAAATCA TTCAGATTTT TTTCyATAAA TTTAGTTTCA 84 0 
(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

TTCTATGAAA CTATCAGCTG TTTCTTTTTC CTTGGTAAAA ATCGTTTTAA CGCCATGATA 6 0 

CATGTATTCT CTTTCAaCAC CCGATTGGTC ATAAGCATCG AAAACACAAA TAATTTCATC 120 

TGAAATAACT GCATTATAAT TTGCAATTGC ATCTATTAAT TGCATTCTAG CTTCTTCTAA 180 

ATTCTCTTTT GCAATGGCGC TTAGCGTTGG TGATTGTCCT ATCATATTGT ATCCATCAAT 240 

GATTAAGTAA CGTTCTTTCA TTATATTTCT CCAACATCAT GTCTTTTTCG AAATACTTCG 300 

TACATCATTA AACTTGCTGC AACCGAAGCA TTCAAACTGT TTACATGTCC AACCATTGGA 36 0 

ATCTTAATAT AAAAATCGCA TTTATCACTT ACTAGGCGAC TCATACCCTG TCCTTCGCTA 420 

CCAATTACAA TAGCCAATGA CATGTCCGCT TCTAGATTTC TATAATCTGT TGCATTATTA 480 

GCTTCAGTGC CAGCTACCCA AAAGCCATTA TCTTTTAGTT CATCGATAGT TTTAGCTAAA 54 0 

TTTGTCACTC GAATAACTGG TACATGTTCA ATTGCACCTG TTGAGGCTTT TGCAACTGTT 6 00 

TGCGTTAGTG TAACTGAACG ACGTTTAGGA ATAATAACAC CATCAACTCC CGTTGCATCG 660 

GCTGTTCTTA AAATTGATCC CAAGTTATGT GGGTCTTCTA AGCCGTCTAA TATAAGTACT 720 

GTCAATAAAC CTTCTTTTTC TTTTTGCTGT TTTAAAAATT GATCGAAGTC AGCATATTCA 780 

TATGGTGCAA TAAGCGCTGC AACACCCTGA TGTGGTGCAT TTGCTAAAAA ATCTAATTTA B4 0 

GAT TTTGGT A CAGTTTGAAC AATGATTTTT TGATCTTTTG CATTTTTTAA AATTTCATTA 90 0 

ATTTGTTGCT TTTTAATACC TTCTTGAATC AATATCTTAT TTAyCGGATG CCCAGTAATA 96 0 
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TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 108 0 

TAAATAaAGA AAACCTATCA CTGCTTCTAT CkCTGAACTT TTACGATATG TTTGAACATC 114 0 

5 

AGTGTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 12 0 0 

TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACTTTTGGC 12 SO 

AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 13 20 

GATATAGGTA CGTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 13 73 
(2) INFORMATION FOR SEQ ID NO: 510: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY-, linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 



TCAAGTGCCA 


ACTACACAAC 


TGATTTACAT 


TCTTTAGGTC 


AATATGTACA 


AGAAGG CCGT 


60 


CGCTTCTTAT 


TCGAAACAGT 


TGTAAAAGTA 


AATCATC CT A 


AATATGATAT 


TACTATTGAA 


120 


AAAGATAGTG 


ATGATCTAGA 


CGGATTAAAT 


TATTTAGCTG 


GTAAAACAAT 


CGACGAAgTT 


180 


AACACAAAAG 


CATTCGAAGG 


TACATTATTA 


GCGCATACTG 


ATGGTGGTGT 


TCCTAACATG 


240 


GTAGTGAACA 


TTCCACAATT 


AGATGAAGAA 


ACTTTCGGTT 


ACGTCGTATA 


CTTCTTCGAA 


300 


CTTGCTTGTG 


CAATGAGTGG 


ATACCAATTA 


GGCGTAAATC 


CATTTAACCA 


ACCTGGTGTA 


360 


GAAGCATATA 


AACAAAACAT 


GTTCGCATTA 


t TAGGTAAAC 


CTGGTTTTGA 


AG a CTTGAAA 


420 


AAAGrATTAG 


rAGAACGTTT 


ATAAAATACA 


TTACTTCAAA 


GATTAGTGAA 


GTTTGAAAAG 


480 


ATAGAACTAG 


ACGTTAACTA 


TTTAAAGCAT 


ATTTTCGAGG 


TTGTCATTAC 


AAATGTAAAA 


540 


ATGTAATGAC 


AACCTCGTTT 


TTATTTATAT 


GCAAGAACTA 


GGTTACTAGC 


TAaTGTGACA 


600 


AGATGTTAAG 


AGAAAATTAA 


aGATAAAATA 


ATATCTGCCT 


TACAATAATA 


TTGTTATACT 


660 


ACTAGnGACT 


GATTTATTAG 


CATGATTACA 


TGTTAATGTT 


TCTTTACTTA 


GTAATTA 


717 



45 (2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTTTCTG GCCCGCCATT 6 0 

TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 120 

GATAAAGAAG GATATATTGG CGCATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 180 

TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 24 0 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 3 00 

TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 3 60 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 420 

AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 4 80 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 540 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 6 00 

ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 7 20 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT TATTAACGAC AATGGCTTCA 7 80 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGT CTA 84 0 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TC CAT AG AAT CGTACCTAAA 90 0 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

GGTTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 1020 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 1080 

CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAG TTAGGT 114 0 

TGGcTGAAAA TTAACTTTAT AGAATCATTG CCTCTAAGAG CGTATTCTTT AGAGTGG k TC 12 0 0 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTCG GCATATTTGT AAAACAAGAT 1260 

CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 13 80 

ACAATACTTC GTATTGAATG G CTTCGCTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 14 4 0 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 15 00 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 16 8 0 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 174 0 
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TACCGTAATG 


ATAGATATAG 


TTTTAAACGT 


GACTTCAAGC 


TATATGAATG 


1860 


TGATGACTGT 


T CAT CATGT T 


CTTTGAGACA 


TCAATGCATG 


AAGCCAAATT 


CGAAATCCAA 


1920 




ATGAAGAATT 


ATAATTGGGA 


ATACTTTAAA 


GyCCAAATTA ATCAAAAGCT 


1980 


TTCTGAACCA 


GAAACGAAAA 


AAATCTATAG 


TCAAAGAAAA 


ATTGATGTAG 


AG C CTGTTTT 


2040 


TGGATTTATG 


AAGGCTATTT 


TGGGTTTCAC 


TCGAATGTCA 


GTTCGAGGAA 


TAAATAAAGT 


2100 


TAAACGAGAG 


CTAGGTTTTG 


TATTAATGGC 


ACTTAATATA 


AGGAAAATAG 


CAGCTCAACG 


2160 


AG CTGTACAT 


TATAAAATAC 


ATATCAAAAA 


AGCTGATTTC 


TATCAAATAA 


TTAATAGAAA 


2220 


TCAGCTTTTT 


TACATTGCCT 


AAGAATTTAA 


TGTCCCAAGC 


CCTTTTTATC 


GAATAACTTA 


2280 


TTGTAAACCT 


TGTCTTTCTT 


GGTTATTGTT 


TT CGTTATTT 


TTTTCGTGTT 


TTTGTTTCCA 


2340 


CTCTTTTTGA 


GTCATTACAT 


CGTCAACTTG 


CATGTTAACT 


TCAACTACAT 


CTAAACCAGT 


2400 


AATATATTTA 


ACTTGTTCTT 


TAACTAATTC 


AGTTACTTTA 


CGGAAGATTT 


TTGGTGCTGA 


2460 


TTCACCATAT 


TCTAAGATTA 


CTTTTAAGTC 


TACAGCAGCT 


TGTTTTTCAC 


CAACTTCAAC 


2520 


AGATACACCT 


TGAGTAACAT 


TGTTGCCACT 


TGAGAATGCA 


TTAGTGAATG 


TATCAGTTAA 


2580 


GCCACCTTTC 


ATATCTAAGA 


TACCTTTAAC 


TTCACGTGCA 


GCGATACCAG 


CAATTTTTTC 


2640 


AACAACTTCA 


TCAGAGAATG 


TTAATTTGnT 


TTTGAATTGA 


GGCTCCTGAT 


TTnGTTCnTG 


2700 



(2) INFORMATION FOR SEQ ID NO: 512: 

30 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
35 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 



ATCAAATACA 


ATAATTAAAA 


TAaTGGCTAA 


TACaCCTAaA 


ATaATAATcG 


AGTTa GAGTA 


60 


GCCTAAGCGA yCACCcTAAC aGTAxACATT 


CTAGGCATAT 


ATTTaTCTTT 


AGACATTGAA 


120 


GCCGCTAACA 


TCGGAAAAGC 


TGTAAATC CA 


GTGTTGGCTG 


CTAATACCAA 


AATCATCACG 


180 


GTCGTTGCCT 


GTACAAAATA 


GAAGGCAG CA 


TTATCACCAA AT ATTTG CAT AGCTAATTGT 


240 


GATAAAACTG 


TCGTTTCCGT 


TTGTGGcAAA 


ATTC CATAAA 


CATATGCTAA 


ACCAACGATA 


300 


CCAACTAATA 


AAAACGCTAA 


AATTGAACCC 


ATAGCAATTA 


ACGTTTTCAC 


AGCATTTTTA 


360 


GCACTTGGCT 


CTCTAAAATT 


GGTTACCGCA 


TTTGAAATAG 


CTTCAACACC 


CGTTAACGAT 


420 


GAAGCCCCTG 


ATGAAAATGC 


TTTTAAT A S C 


AAGAATAATG 


TTACTCCAGG 


AACCGCAGTT 


460 



1439 



EP0 786 519 A2 



AATATTAATA TCACTAACCC TATAATGAAA AGATATACTG GATAGGATAA TACGGTGGCA 6 00 

GATTCAGTTA AACCACGTAA ATTTAATATT AAAATAAAAA GTACAAGTAw ACATGCAATC 660 

AGTrCkTTAT GCCCATATAA ACTTGGGAAT GCAGCAACAA ATGCATCAGC ACCAGATGAT 72 0 

ATmCTAACAG CGACAGTCAG TATGTAATCG ACTAATAATG AGCCTCCTGC AAGCAATCCC 7 80 

CATTTTTCTC CTAAATTGGT CTTGGACACC ATATACGCGC CGCCACCTTT AGGATATGCA 84 0 

TAAATAATTT GCCTATAAGA CATAATTAAA GCAGCTAATA AGATCAGAAC AGCACCTGCA 9 00 

ATCGGTAAAG TATACCAAGT TGCAACTGCA CCCACTACTG ATAGTGTAAT CAGTATTTGT 96 0 

TCGGGACCGT AAGCCACTGA AGATAATGCA TCCGACGAAA GAATCGCTAA CCCTTTAAAC 102 0 

TTCGATATCT TTTCGTCTTT TAGTTCTCTG TTTTTTTTAG GTTGCCCTAT AATAAGt CTT 108 0 

TTAAATTGAT kGAcATAAtC TCCTATTCCT TTTTATAGTT TTnGATGGAA ATCAC 113 5 
(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 91B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

TTATGGTAAA ACTTTAACAC AAGCATTACA TAAATCGCAA TCATTTATGT GGCAAAAACG 60 

TTGAATTTAA GCCATTTCTA TTCATTTTAC GTAATTATTA GCCGTATATG TGTAATAATA 12 0 

CACATTTTAT TCAGATTTTT TATCGCGCTC CATTAAATCT TTTACGCATT CTTTTACTGA 18 0 

GATATTTTCA AATAATACTC TATATAATGC ATTTGTAATT GGCATATCCA CATTTTTTTC 24 0 

TTTAGCTAAA TGATAAACTG ATTTAGTTGT ATAAATACCT TCAACAACCA TATTCATTTC 300 

AGATAATGCT TGATCCATTG ATTCACCTTG TCCAAGTTTA TATCCTAATG TGAAATTCCG 360 

AGAATGTGTT GATGTGCAAG TAACGATTAA GTCACCGATA CCACCTAAAC CTAGAAATGT 4 20 

CATAGGATCG GCACCTAACT TTTCACCTAA TCTACTAATT TCCGCTAAGC CACGAGTCAT 48 0 

TAATGCAGCT TTTGCATTAT CACCGTAGCC AATTCCAGCT ACGATACCAC TTGCTACTGC 54 0 

GATGATATTT TTCAATGCAC CACCAAGTTC AACACCAATC AAGTCATCAT TCGTGTACAC 6 00 

ACGCAAATAA TCATTCATAA ATAAATCTTG CGTTAATTTA CTTA CACTTT TATCTTTTGA 660 

TGAAGCAGCA ACTGTAG TTG GTTG CTTGAC TACAACTTCT TCCGCATGAC TTGGCCCTGA 72 0 

CAACACGCCA ATACCTG CAT TATATTCAGG TGAAATAGAA TCTTCAATCA TTTCTGACAC 7 80 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 900 
GTAAATATCT GCAAATTG 918 
(2) INFORMATION FOR SEQ ID NO: 514: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



JS (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

CAATACTCGG TATTATAGAT AG T C CTACTA CATTTATATG GGTTTGTATC ATTACATTAA 6 0 

TTG CACAT C A ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 120 

20 TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GG CTTTA CAT 180 

TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 240 

AATACCGCCA ACGCATTATT GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 300 

25 GTCTTTTAAG AATATAAGAT ACTATCGCAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 350 

TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 420 

TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 4 80 

30 

GATATTATTT AATTGCGTGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 54 0 

TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 58 7 
(2) INFORMATION FOR SEQ ID NO: 515: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 60 

TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 120 

TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 180 

TATTGTGTGC TCTCG CTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 24 0 

TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA TAAAwAcCAA 3 00 
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TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 420 

TTATGATGAC AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 4 80 

GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 54 0 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TT CTAATGTA TATGAAAGTG TATCAATATA 600 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 6 60 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 720 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 780 

GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 812 
(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

. Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 6 0 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 120 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTACCAA 180 

TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 24 0 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA TCTTTTATTA AATGA CTTGT 300 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 360 

AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT TGAAATGACA AATGTCATTA 4 20 

TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 4 80 

ATAATCGrGc TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 526 
(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS - double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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GGAACACCTA AAGAAAGAAA TGATGCATTA AACACAGAGG CTGATATCTA TGTAACCAAT 12 0 

AAAGAAAATA CTAAATGGTT ATGCGATCAA TATAAAAAAG AATGGCCATT TGATATGGTT 180 
5 GTAATTGATG AACTGTCTAC ATTTAAAAGT CCTAAGAGTC AAAGGTTTAA AT CTATTAAA 24 0 

AAGAAATTAC CACTCATTAA TAGATTTATA GGATTAACAG GAACACCTAG CCCAAATAGT 3 00 

TTACAGGATT TATGGGCTCA AGTTTATTTG ATAGACAGAG GCGAAAGACT TGAGTCTTCA 3 60 

10 

TTCAGTCGTT ATCGAGAAAG GTACTTTAAA CCAACACATC AAGTTAGCGA ACATGTTTTT 4 20 

AACTGGGAGC TAAGAGACGG ATCTGAAGAA AAGATATATG AACGAATAGA AGATATATGT 4 80 

)5 TTAAGCATGA AAGCGAAAGA TTATCTAGAT ATGCCTGACA GAGTTGATAC TAAACAAACA 54 0 

GTAGTCTTAT CTGAAAAAGA AAGAAAAGTA TATGCAGAAT TAGAAAAAAA CTATATTTTA 60 0 

GAATCGGAAG AAGAAGGAAC AGTTGTAGCT CAGAATGGGG CATCATTAAG TCAAAAACTA 66 0 

20 CTTCAACTAT CTAACGGTGC AGTTTATACA GATGATGAAG ATGTAAGACT TATACATGAT 7 20 

AAGAAGTTAG ATAAGTTAGA GGAAATTATA GAGGAGTCTC AAGGCCAACC AATATTATTG 780 
TTTTATAACT TCaAACATGA TAAAGAAAGA ATACTTCaAA GGTTTAAGGa AGCAACCACA 84 0 

25 TTaGAGGATT CAAACTATAA AGAACGTTGG AATAGTGGAG ACATTAAGCT GCTTATAGCA 900 

CATCCAGCAA GTGCAGGGCA TGGATTAAAC TTACAACAAG GTGGG CACAT TATTGTTTGG 96 0 

TTTGGACTTA CATGGTCATT GGAATTATAC CAACAAGCAA ATGCAAGATT ATATAGACAA 1020 

30 

GGACAAAATC ATACGACTAT TATTCATCAC ATTATGACCG ATAACACAAT AGATCAAAGA 1080 

GTATATAAAG CTTTACAAAA TAAAGAACTA ACGCAAGAAG AATTGATGAA AGCTATTAAA 114 0 

35 GCAAGAATAG CTAAGCATAA GTAATGGAGG TATAAGATGG GAAAGGCATC ATACGATATT 120 0 

AAGCCAGGTA CATTTAAATA TATTGAGTCA GAGATATATA ACCTACAAGA GAACAAGAAA 126 0 

GAGATAAATA GATTGAGAAT GGAGATACTT AACCCAACGA AAGAGCTAGA CACTAACATT 1320 

40 GTGTATGGAC CGTTGCAAAA AGGTGAACCA GTTAGAACAA CTGAAC TAAT GGCAACAAGG 138 0 

TTATTG AC TA ATAAGATGTT ACGAAACCTA GAAGAAATGG TCGAAGCAGT TGAAAGTGAA 144 0 

TACTTAAAGT TACCTGAAGA TCATAAGAAA GTAATTAGGC TAAAGTATTG GAATAGAGAT 1500 

45 AAGAAGCTAA AGATAGAGCA AATAGGAGAT GCATGTCACA TGCATCGTAA TACAGTTACT 1560 

ACTATACGAA AGAACTTTGT TAAAGCGGTA cGwtATCATG CAGGTATCAA ATAACATTGT 1620 

G CAAAG ATTG TGCAAAAGGC CTACAAATCT GTAGTAATAT GATAGTATCG GAAAGATGTA 168 0 

50 

TAAAGTTATC TAAAAGTTAT ACGACACAAG TACACGAGGC ACATCGCTAT GCGtGTGTCT 174 0 

TTTGTTATGC AAT CAAAG AG GTGTAAGAGA TGACCAAGCA TAATAACATT TATAAGCATG 180 0 

55 
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AGATAGCATT AGATAGGGAT AATCATCTTT GTCAAATGTG TTTACGTGAA GACATAGTAA 192 0 

CAG ATG CAAA CATAGTGCAT CATATTATTT ATGTTGATGA AGATTTTAAT AAAGCTTTAG 198 0 

5 ACTTAGATAA TTTGATGTCA GTTTGTTATA GCTGTCATAA CAAAATTCAT GCAAATGATA 204 0 

ATGACAAAAG TAATCTTAAG AAAATTAGAG TATTAAAAAT TTAAATAAAA AAATAATTTA 210 0 

TTTTTATAGC CCCCTACCCA TCGGCTTAAA ATGTTTTTTC GACGGGTACC GGCGGGGGCC 216 0 

w 

CTTCGCTTGC AACGCGGATA AACTTTTATG AAAGGGGGTC TTTATATGAA ATTAACAAAA 222 0 

AAACAGCTGA AAGAATATAT AGAGGATTAT AAAAAATCTG ATGACATATT AATTAATTTG 228 0 

TATATAGAAA CGTATGAATT TTATTGTCGG TTAAGAGATG AACTTAAAAA TAGTGATTTG 234 0 

15 

ATGATAGAG C ATACAAACAA GGCTGGTGCG AGCAATATTG TTAAGAATC C ATTAAGCATA 24 0 0 

GAACTGACAA AAACAGTTCA AACACTAAAT AACTTA CTCA AGTCTATGGG TTTAACTGCA 246 0 

20 GCACAAAGAA AAaAGATAGT TCAAGAAGAA GGTGGATTCG GTGACTATTA AAGTTTTAAA 252 0 

TGAACCTTCA CCAAAACTAT TAACAACATG GTATGCAGAG CAAGTCACTC AAGGGAAAAT 258 0 

AAAAACAAGC AAATATGTTA AAAAAGAATG TGAGAGACAC CTTAGATATC TAGAAAATGG 264 0 

25 AGGTAAATGG GTATTTGATG AAGAATTAGC GCACCGTCCT ATTCGATTCA TAGAAAAGTT 2700 

TTGTAAACCT TCCAAAGGAT CTAAACGTCA ACTTGTATTA CAACCATGGC AACATTTTAT 2760 

TATTGGCAGT TTGTTTGGTT GGGTTCATAA AGAAACAAAA CTGCGCAGGT TTAmAG AAG C 2820 

30 

TTTGATATTT ATGGGGCGAA AAAATGGTAA AACAACTACT ATATCTGGTG TTGCTAACTA 288 0 

TGCTGTTTCT CAAGATGGAG AAAACGGCGC TG AAATC CAT CTTTTAGCAA ACGTAATGAA 294 0 

AGAAGCTAGG ATTCTATTTG ATGAATCTAA GGCGATGATT AAAGCTAGCC CAAAGCTTAG 3 0 00 

35 

AGAAAATTTT AGACCTTTGA GAGATGAAAT TCATTACGAT GCAACTATAT CTAAAATTAT 3 060 

GCCACAGGCT TCAGACAGTG ATAAGTTGGA TGGTTTAAAT ACACATATGG GCATTTTTGA 3120 

4Q TGAAATTCAT GAATTTAAAG ATTATAAATT GATTTCAGTT ATAAAAAACT CAAGAGCGG C 3180 

AAGGTTACAA CCCCTTCTTA TCTACATTAC GACAGCAGGG TACCAACTAG ATGGACCACT 3 24 0 

TGTTAATATG GTAGAAGCGG GAAGAGACAC CTTAGATCGA ATCATCGAAG ATGAAAGAAC 3 300 

45 TTTTTACTAT TTAGCTTCTC TCGATGATGA CGATGATATA AATGATTCGT CGAATTGGAT 3 360 

TAAAGCAAAT CCTAACCTAG GTGTTTCTAT CGATTTAGCT GAAATGAAAG AAGAGTGGGA 3420 

AAAGGC T AAG AGAACACCAG ATGAACGTGG AGATTTTATA ACCAAAAGGT TTAACATCTT 34 80 

S ° TGCTAATAAT GATGAGATGA GTTTTATTGA TTATCCAACA CTTCAAAAAA ATAATGACAT 3 54 0 

TATTTCCTTA GATGAGTTGG AAGGTAGACC ATGTACTATA GGTTATGATT TATCAGAAAC 3 600 
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AACACATTCT TGGATTCCTA AG CAT AAAGT TGAATATTCT AACGAAAAGA TACCCTATAT 372 0 

AGAATGGGAA GAAGACGGAT TACTAACAAT ACAAGATAAT CCTTATATAG ACTACCAAGA 37 8 0 

TGTTTTAAAT TGGATAATAA AGATGAATGA GCATTATGTT GTCGAAAAAA TCACTTATGA 3 84 0 

TAGGGCGAAT GCTTTTAAAT TAAATCAAGA GTTAAAGAAT TATGGCTTTG AAACAGAAGA 3 90 0 

AAcwAGACAA GGGGCTTTGA CCTTGAGCCC TGCaTTGAAG GATCTAAAAG AAATGTTTTT 3 96 0 

AGATGGGAAA ATAATATTTA ATAAT AATC C TTTAATGAAA TGGTATATCA ATAATGTTCA 4 02 0 

GCTGAAACTA GACAGAAATG GGAACTGGCT GCCATCTAAA CAAAGCAGAT ATCGTAAAAT 4 08 0 

AGATGGTTTT GCAGCATTTT TAAACACATA TACAGATATT ATGAATAAAG TTGTTTCTGA 414 0 

CAAGGGTGAA GGAAACATAG mATTTATTAG TATTArAGAT ATAATG CGTT AAGGAGGTGA 4 200 

ATGTTATCGC AAAAGAGAAT ATTGTCACAC G CAT AAAG AA AAAATTGATA GACAATTGGA 4 26 0 

TTGaTCAGTC AGCTTCTAAG CTTTATGACT TTAGCCCATG GAAAAATAAA TCTTTTTGGG 432 0 

GTGTAATCAA T AATACG CTT GAAACTAATG AAACGATATT TTCAGCTATT ACnAAGTTAT 43 8 0 

CTAATTCGAT GGCTAGTTTG CCCTTGAAAA TGTATGAAGA TT AT AAAG T A GTTAATACAG 44 4 0 

AAGTAT CTGA TTnACTTACA GTGTCACCGA ATAATTCTCT GAGCAGTTTT GATTTTATTA 4 50 0 

ATCAAATTGA AACAATCAGA AATGAAAAAG GTAATG CATA TGTG 4 544 
(2) INFORMATION FOR SEQ ID NO: 518: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 881 base pairs 

(B) TYPE.- nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 6 0 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 

TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTC CTTTG 24 0 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 3 00 

AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 420 

AATAAAGATT TAATCGTGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 48 0 
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ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 600 

TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTATT 660 

5 ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 7 20 

ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 78 0 

GCGAATAATA CmAAGAGACA TCAATCG GTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 84 0 

W 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 881 

(2) INFORMATION FOR SEQ ID NO: 519: 

(5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3122 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 

TGAATaaAAA TATATTAATA GATAAACACA AATGTGTCcA AATACCCCTA GAGG TATTTG 60 

25 ACnAGTTCCA TCCAACTGTT TAAAATACCC CTACAGGTAT TTTTAGGGAG GTTATTATGA 12 0 

AACAATACGG AGAAAaGTTT ATCGATGAAT TTAGTAAAGC AGAATTGGAA AAACTAGCCA 18 0 

AGCAAGGGCA ATTAATTGAC GTTAGAACAG AAGAGGAGTA TGCATTAGGA CATATCAATG 24 0 

30 

GTTCCATACT TCATCCTGTT GATGAGATTG AGTCATTCAA TAAAGAAAAA AATAAAACCT 3 00 

ATTATGTAAT CTGTAGAAGT GGTAACAGAA GTGCTAATGC TAGTAAATAT TTAG CTAAAC 36 0 

^ AAGGTTATAA CGTTATAAAT CTTGATGGTG GTTATAAAGC TTATGAAGAA GAAAACGATA 420 

GTTATGATAC ACAAGAAGAA TATAAAAGTA TAGAAATTAA AGCAGATCGT AAACAATTTA 480 

ACTATCGTGG TCTTCAATGT CCAGGGCCAA TTGTAAAAAT TAGTCAAGAA ATGAAGAATA 54 0 

40 TTGAAGTAGG TGACCAAATT GAAGTCAAAG TCACAGACCC TGGATTCCCT AGTGACATTA 60 0 

AAAGTTGGGT GAAACAAACA AGG CATACTT TAGTTAAG CT TGATGAAAAT AACAATGGAA 6 60 

TTAATGCGAT TATTCAAAAA GAAAAAGCAA AAGATTTAGA TATAAATTAT TCTGCTAAAG 72 0 

45 GTACTACAAT TGTATTATTT AGTGGAGAAT TAGACAAAGC TGTAGCAGCG TTGATTATTG 78 0 

CAAATGGTGC TAGAGCTGCT GGAAAAGATG TAACTATCTT CTTTACTTTT TGGGG GCTTA 84 0 

ATGCATTAAA AAAAGTGCAA ACAGTTAATG TTAAAAAGCA AGGTATTGCA AAAATGTTTG 900 

50 

ATTTAATGTT GCCCAAAAAG AATATACGAA TGCCTCTTTC CAAAATGAAT ATGTTTGGTT 96 0 

TAGGAAATAT GATGATGCGC TACGTAATGA AAAAGAAAAA TGTTGATTCA TTACCAACAC 102 0 
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TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 114 0 

ATATTGGTGC TACTGAAAAT GCGAATCATA ATTTATTTAT CTAATTAAAT CTATTAATAA 12 0 0 

5 

AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 126 0 

ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 13 2 0 

CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 13 8 0 

W 

ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 14 4 0 

TATATGTGTC TGGCGAAGGT GAAGATG CAT TAGGGTATAA AAATATGCCA TCAAAAACAC 150 0 

, s AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA GTTCTGCATA 1560 

CTCCAGGACA CACGCCTGAA AGTATTAGCT TTTTACT CAC TGATTTAGGT GGTGGTTCAA 162 0 

GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 168 0 

20 ATTTAtTAGA AAAATCTGTT CaAATAAAgG GTTCTACAGA AATTAGCGCG AAACAAATGT 174 0 

ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 1800 

GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 186 0 

25 AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATGAGAC TAAATTTATT GAATCATTAA 192 0 

CATCAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 198 0 

GTATGAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AGATAATAAG AGAGTAGCAT 204 0 

30 

TTGATCTTCG TAGCAAAGAG GCCTTTCACG GTGGCCACAC AAAAGGAACA ATCAATATAC 2100 

CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 216 0 

35 ATGTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 22 20 

TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGGCAT TTCAAC CCAG TCCGTTCATA 2280 

GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 2340 

40 ATAATGGACA CTTAGATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 24 00 

TTCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGT CA GTCAGGTGTT AGAAGTTCAA 24 60 

TTGCAGTGGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 2 52 0 

45 ATCAAGATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAG CATT 2 580 

ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 2 64 0 

GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAGCAGAC CATCATTACT 270 0 

ATCAAC CATT ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCT CGTAAAG 2760 

ATATGGAAAG TGTTATACCT GAAGGTGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 2 820 

55 
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TAGTAGCTCC AGGATTACAG ATTAATTGGT CTTCAATTAA AGGACTAAAA GAAAATATAG 2 94 0 

GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 3 000 

TTTCTAATTT TAAACAAGGA AATGCCATTT TTACGCATCC AAACACTCCT ATAAAGTGTG 3 060 

GAGGTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA CATAAAATCC 3120 



3122 



(2) INFORMATION FOR SEQ ID NO: 52 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

ATAnAGATAT AnATnAATAT ATTGAGGTCA AACGATGATA ATTAAAATTT TAA CAATTCT 60 

ATTACTACTT TGTATATTGA GCTATTTGGT TACAAATAGA AAGAAGC CTT TTCTGTTCTT 12 0 

AAAGACACTC TTTATGGGTG TGGTATTTAT CTTTATAGGA TATATTTCAC TGGCAATATC 18 0 

TGCCGTAATT ATT T ATGGTA TTATTCAATT TATCACAATT GATTTTGGTA GTTTTTTCTT 24 0 

AATGGGTATT ATATTGATCT TGATTTCAAG TATATTCCAA TTATTTATAG TTAGATTACT 300 

TTTTAGAAAA AAGAATGTCG ATTTGACAGA GGTTGT CGTT TTAs As CATT TAATTCAATG 360 

GTTCTTAGTT TAC TTTGCG A TCTATCAAGC AGTAAATGAA AAAATGGACA TTAATGATAT 4 20 

TAATATCGAC AATTTCCAAT CTGTCTTTTT TGACGTGTCT AATTTGAATT TAGTAATTCT 4 80 

ACCAACGTTA ATCATTAGCT GGGTCACAAT ATTTAACTAT AGAATGAGAA GTTACAAATA 54 0 

AAATCTATGA GATTATACCT TCAGACACCA ACATTCAAAT GGTGTCTTTT TTGTTGTGTG 60 0 

GTTTTATTTT TGAAATTCGA AAAAGTAGAG GCATGAATTT TTTGACTAGT GTATAAGTGC 66 0 

TGATGAGTCA CAAGATAGAT AG CTATATTT TGTCTATATT ATAAAGTGTT TATAGTTAAT 72 0 

TAATAATTAG TTAATTTCAA AAGTTGTATA AATAG GATAA CTTAATAAAT GTAAGATAAT 78 0 

AATTTGGAGG ATAATTAACA TGAAAAATAA ATTG AT AG CA AAATCTTTAT TAACATTAGC 84 0 

GGCAATAGGT ATTACTACAA CTACAATTGC GTCAACAGCA GATGCGAGCG AAGGATACGG 900 

TCCAAGAGAA AAGAAACCAG TGAGTATTAA TCACAATATC GTAGAGTACA ATGATGGTAC 960 

TTTTAAATAT CAATCTAGAC CAAAATTTAA CTCAACACCT AAATATATTA AATTCAAACA 1020 

TGACTATAAT ATTTTAGAAT TTAACGATGG TACATTCGAA TATGGTGCAC GTCCACAATT 108 0 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 12 00 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 126 0 

5 AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 13 20 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 13 80 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 1440 

10 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 150 0 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAG CAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 16 2 0 

15 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 16 8 0 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG TATCAATGGC 174 0 

2Q TGATG CTAAA GTTG CATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 18 00 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAAC CTG TTTTCAATTT GCATATGTGA 186 0 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGA GACAAA 1920 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 204 0 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

30 AAGTTTTTTC AATACT CGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 216 0 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 22 2 0 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 228 0 

35 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 234 0 

TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 24 0 0 

^ CGGGGCCCCA ACATA GAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 24 6 0 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 252 0 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAG CTA ATGATAAAGA ACCACTACAT 2580 

45 AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 264 0 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 270 0 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 276 0 

50 ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2 82 0 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2880 
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AATTTGATGA TGATATTTGC TTTTTATTTT CCAAATGGAA TTTACTTAAA CTGATGCATT 30 0 0 

AAAATATTAA TGAAGCACTA GAATACATAA ATGAATAGTa ATGGTG cACA GTATAGAATA 306 0 

ATTAAGGCTA TATTAAGTAT AAATATCGTT AACTGTAAGC TATCTTTAGT TTTAATATAA 312 0 

ACTATTAGGA TAATCGACGT AAGAAGAATC ATATATATTA ATGATGAAGA AGTCCATACA 318 0 

AAATCCGCAT CATTTGTTGT TAATAATGGG ACTATAATTA ATCCGAAATT AATCATGCAT 3 24 0 

GCTATATATA CTATAATGTT ATACACAATG TTAATTTTTG TTCACCACCT TATACTTCTA 3300 

TTTTAAAAAC TTCTTTATAA TGATATATTG TTTAATGTTG AAATAATTAG ATTATCTAAT 336 0 

TTTCATTTGC TTTACATGTA AAAGGCTATA TATAGTATGC TCTTTATGAT TCTAAATGCT 34 2 0 

TTTTAATATT TAATGCTCAT CAACATTTGG ATTTTGAATA TTCAATTCAA AAACTTTATT 34 8 0 

AGCTACGTCA ATTGTAAAAT CAGAACCATA GTTGACATGA GCTACTTTTA ATTTTCCATC 3 54 0 

TAAATAATAG ATTGC GATTG CAACATCGTA AAATTCGTCA ATGACAAATA AACTCTTTTC 360 0 

GTTTGTTACA ACCTCATGCT CTCCTGAGTA TACAACGTTA ATTTCCCAAT CATTAAAAAC 366 0 

CATTTGTTAA CCTCCTTGAA CATTTAAATT GATT CAACTT AAGTTTAACT TATTCATACA 3720 

ACTTCGTACA ATATCTAGAT GAACATTAAT TGTATTTCTA GAAATCTTTT TCAATTATAT 378 0 

GTACTAATTA TACTTTTAAA TTTCTTATTT CAGTATAGTT TTAAAACGAT TTTAAAATAA 3 84 0 

TTCTGCAAAT ATATTAACAC ATAATGTGTT CAAAAAGTTT TGAACAATTT CAAAAC TTTT 3 900 

ATATAAAGGG nTTGACAACA TGGATTCAAA TnTCTTATTT TAAAAATTAC CTCATATAGT 3 96 0 

GTCATGTTAG CCAATTTTTA AG 3 982 
C2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1353 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

AGCTTGGaTG ATTTAATa TG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 60 

TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 120 

GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 180 

TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 24 0 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 300 
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GTTAGGTAAA ACATTTTATC AAAACTATAG AGATGATGTT GAAAGTTTAT ATAGTAAGTT 420 

AGATTTAATT ATGGGATATA AAGATGAAGA AAGAGCAAAT AAAAAAGCAG TTAACAAAAG 4 80 

GATGTTAGAA AATAAAAAAG AAGACTTAGA AACCATAATT GATGAATTTT TTAGTGATAT 54 0 

AGATAAAACA AGACCTAATA ATATTCCTGT TTTAGAAGAT GAAAAACAAG AAGAGAAAAA 6 00 

TCATAAAAAT ATGGCTCAAT TAAAATCTGA CACTGAAGCA GCAAAAAGTG ATGAATCAAA €60 

AAGAAGCAAG AGAAGTAAAA GAAGTTTAAA TACTCAAAAT CACAAACCTG CATCTCAAGA 720 

AGTTTCTGAA CAACAAAAAG CTGAATATGA TAAAAGAGCA GAAGAAAGAA AAGCGAGATT 78 0 

TTTGGATAAT CAAAAAATTA AGAAAACACC TGTAGTGTCA TTAGAATATG ATTTTGAGCA 84 0 

TAAACAACGT ATTGACAACG AAAACGACAA GAAACTTGTG GTTTCTGCAC CAACAAAGAA 900 

ACCAACATCA CCGACTACAT ATACTGAAAC AACGACACAG GTACCAATGC CTACAGTTGA 960 

GCGTCAAACT CAGCAACAAA TTATTTATAA TGCACCAAAA CAATTGGCTG GATTAAATGG 102 0 

TGAAAGTCAT GATTTCACAA CAACGCATCA ATCACCAACA ACTTCAAATC ACACGCATAA 10 80 

TAATGTTGTT GAATTTGAAG AAACGTCTGC TTTACCTGGT AGAAAATCAG GATCACTGGT 114 0 

TGGTATAAGT CAAATTGATT CTTCTCATCT AACTGAACGT GAGAAGCGTG TAATTAAGCG 12 00 

TGAACACGTT AGAGAAGCTC AAAAGTTAGT TGATAATTAT AAAGATACAC ATAGTTATAA 12 6 0 

AGACCGAATA AATGCACAAC AAAAAGTAAA TACTTTAAGT GAAGGTCATC AAAAACGTTT 13 20 

TAATAAACAA ATCAATAAAG TATATAATGG GCA 13 53 
(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

GTCTGAGTCA GGTGCTGTTT GGTTAGATGC TGAAAAAACA AGTCCTTATG AATTTTATCA 60 

ATTCTGGATT AATCAATCAG ACGAAGATGT AATTAAATTC TTnAAATACT TTACTTTCTT 120 

AGGAAAAGAA GAAATTGATC GCTTAGAACA ATCTAAAAAT GAAGCACCGC ATTTACGTGA 180 

AGCTCAAAAA ACATTAGCTG AAGAAG TAAC TAAATTTATT CATGGTGAAG ATGCATTAAA 24 0 

TGATG CAATC CGTATTTCAC AAGCATTATT TAGTGGTGAT TTAAAATCAT TATCAGCGAA 300 

AGAATTAAAA GATGGATTTA AAGATGTGCC TCAAGTGACA TTATCAAATG ACACAACAAA 360 
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TGTTAACAAT 


GGTGCGATTT 


ATATTAATGG 


TGAGAGACAA 


CAAGATGTTA 


ATTATG CTTT 


480 




AGCACCAGAA 


GATAAAATTG 


ATGGCGAATT 


TACGATTATT 


CGTCGCGGTA 


AGAAAAAATA 


540 


s 


CTTCATGGTT 


AACTATCAAT 


AAATATAATT 


GCATAGCTAA 


ATAAATTAGA 


GCCTACTCAT 


600 




ATTCATTCCT 


AAGAATGTAA 


TGAGTAGGCT 


CTTAATGTAC 


TTTTCTGTCT 


GTAAATTATC 


660 




TAAAGAAATC 


TCTATCGCCT 


TGGCCAGGTG 


ATTGACGTTC 


TGATCGGCTT 


TGA CGTTTTG 


720 


w 


GTTGTTCTTT 


TTGTTGTTTT 


AATTTCACTT 


TAACTTCTTT 


TGTTTTACCA 


TCACGGATAA 


730 




CGGTAACAGT 


GACTGATTCA 


CCAGGTTTTT 


TATTTTCATA 


TAAATAGCTT 


CTTAAATCAA 


840 


15 


CATCATCTTT 


AATTTTCTTG 


CCATCAATTT 


CTGTAATAAT 


ATCACCTTTT 


TTAAGATCAA 


900 




TATCACTATC 


AGCTTTGGCG 


ACATAAATAC 


CGTCTTCTCT 


ATCAGTATGA 


AGTTGCTCGC 


960 




GCTCTTCTTC 


AGGAATATCT 


TTCAAATTAA 


TTAAACCAAT 


ACCAATCGAA 


GGGCGGTCAA 


1020 




TTTTACCATG 


TTTTACAAGT 


TGTTCAATTG 


TTACTTTAAC 


TTCATTACTT 


GGAATAGCAA 


1080 




ACCCGATACC 


TTCAACTTGT 


GTCGCAGCAA 


TTTTCATTGA 


GTTAATACCA 


ACTAAATTAC 


1140 




CATTAATATC 


TACTAATGCG 


CCACCTGAGT 


TACCTGGGTT 


AATAGCAGCA 


TCTGTTTGAA 


1200 


25 


GAACG CTAAC 


TTTTGTATTG 


CCACCAGTTG 


TCTCAGCGTC 


AATCGTACGT 


TCGCTTGCTG 


1260 




AAATGATACC 


AGATGTTACA 


GAGTTAGCAA 


ATTGTAATCC 


TAATGGGTTA 


CCCATTGCGA 


1320 




ATACGCTATC 


GCCAGTTTGT 


ACTTTTGAAG 


AGTTGGCAAA 


TTGAATCGCT 


TTAATACCTT 


1380 


30 


TTGTATTTTC 


AATTTTAAGT 


ACAGCAATAT 


CAGTTACTGc 


ATCTTTACCA 


ACTAATTTCG 


1440 




CTTTAACTTG 


TTTTTTATTA 


TGTAATTGGa 


CTCTAATTTC 


ATTTGCGCCA 


TCAATAACAT 


1500 




GATTGTTTGT 


AACGATATAA 


GCTGAATTGT 


TGTTTATTTG 


aTAGATAACA 


CCTGAACCTA 


1560 


35 


CTCCAGCTTC 


AGATGGTTTA 


GATGATTTGC 


CyTTTAATAA GTCGTCTACA CTTGATGCTT 


1620 




TTtGCATGTk. 


AATAACTCCA 


ACAATTGTAG 


GGGAGACAGA 


TTTTATCATT 


TCATGAACGG 


1680 




TAC CGAATTT 


CTTGCTTTGA 


CCGTCTAATT 


GATTGCCACC 


TTTATTATTT 


GTTGTCTGAA 


1740 


40 


CAGTTGAACC 


ATCTTTATTT 


AAAATTGTAC 


TATTTAATAC 


TTTGCCTATA 


CCAAGTACTA 


1800 




GAAGTGCACC 


AATAATTCCA 


GCAATCAATG 


CAACGATGAC 


TGTnTTAAAC 


CATGGAAATT 




45 


TAGGTCTTCT 


GTATCTAGGT 


GTTTGGCTAT 


GGTTTGTTGT 


AGAATGATCT 


GTATGATTAA 


1920 




AATCTGACAT 


ACTTAACCTC 


CATTATATGA 


TTTATATATG 


CTTTAATTAT 


GTCTTTTnTT 


1980 




TATAATT 












1987 


50 


(2) INFORMATION FOR SEQ ID NO: 523: 









ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

55 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 523: 

TTAGAAGTCA AATCATTAnT GGCGTCnTAT CGAGTATACT ATTAACTTCA ACTATTTTAG 60 

CAATTGCATA TATTTTAATG TGGTTTAACG GCCATATGAC aCTAACTTTG ACCTTAACGA 12 0 

CAATAATTAC AAGCTGTTTA ACCTTATTAA TATGTAGTAT TTTTATTAAT CCACTTATAC 180 

AAAAAATTAA GCAGTTTAAT ATAAAAACTA AGCAATTTGC TAACGGAAAT TACGCAAGCA 24 0 

ATGATAAAAC GTTTAATTCA CCAAAAGAAA TTTATGAATT AAATCAATCT TTTAATAAAA 300 

TGGCTTCTGA AATTACGCAA CAAATGAATC AAATTAAATC CGAACAACAA GAAAAAACAG 360 

AACTGATTCA AAACTTAGCC CATGATTTAA AAACACCTTT AGCAAGCATT ATTTCATATT 42 0 

CTGAAGGACT ACGTGATGGT ATAATCACTA AGGATCATGA GATTAAAGAG TCATACGACA 4 80 

TATTAATTAA ACAAGCAAAC AGATTATCAA CATTATTTGA TGATATGACT CATATTATCA 54 0 

CTTTAAATAC AGGTAAAACA TATCCCCCAG AATTAATACA ACTAGACCAA TTACTTGTAT 600 

CAATATTGCA ACCATATGAG CAACGTATCA AACATGAAAA CCGCACATTA GAAGTGAATT 660 

TCTGTAACGA AATTGATGCA TTTTAT CAAT ATCGAACGCC ACTTGAGCGT ATTTTAACAA 72 0 

ACTTACTTGA TAATGCGCTA AAATTTTCAA ATGTTGGTAG TCGCATTGAT ATTAATATTA 78 0 

GTGAAAAC G A AGATCAAGAT ACTATCGACA TTGCTATTAG CGATGAAGGT ATTGGCATTA 84 0 

TACCAGAACT ACAAGAACGT ATATTCGAAC GTACATTCAG AGTAGAAAAC TCTCGTAATA 9 00 

CAAAAACGGG TGGTTCTGGA TTAGG CTTAT ATATAGCTAA TGAACTCGCG CAACAAAATA 96 0 

ACGCAAAAAT CAGTGTAAGC AGTGATATAG ATGTAGGAAC TACGATGACT GTAACATTAC 1020 

ACAAATTAGA CATTACGTCA TAATCCGATT TATTTATAAA ATAAAATGCA AAGACTAAAA 108 0 

AGAAGCTCCC ATTAATGAGG GCTTCTTTTT TTGTTTATTT AGAATAAACT TTATGGGTAT 114 0 

CCTTCTCATC ATTTTCAAGA CTTGAAAGAT TTGTAGCTTG AATAATATAT TTAGGACGTG 120 0 

CCTTAACTTC ATAATATATC CTGCCAATAT ATTCACCTAC AACACCAATT GAAATTAACT 126 0 

GTATGCCGCC TAATAATAAA ATAGCTGCAA TCGTTGAAAA ATATCCCGGA ATATTAACAC 132 0 

CAGATATCAT AATATTGATG GAATAAATAG A 13 51 
(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

TTGTTGT CAG TTTAACGCAA CGTGTTACCG ACTTTTCAAG TACACAAcAA TATATATGTC 6 0 

TATTCCATTT TTAGCCCcTG C CATTTTCAT CATTGGTGGT ATTATGTTGT TTATTTCAAC 12 0 

ATTTAATAGC TTAGATGAAA CTGCTGAAAA TAATAACAAA ATAAAGAAAC TAATGATTAA 180 

AGGACTTATC ATTATTAACA TTTCATTTAT CGTTATGATG GTTTTAACAC CATATTGGTA 24 0 

CTTGTATTTA ATCGTCTATC TTATTTTCTT GTTGTTTTTA TTGTGGCAAA AGGTTTATAA 3 00 

ATTTTAATAC CAAAACTATT AAACACTTCT GATATTCTTA GTTCAAAATA TCAGAAGTGT 36 0 

TTTATAGTGT TATCTAGTTC AGATAAATAT TTCCTTACTT AAAAAAACGC CCTCCTCTTA 420 

TTTTGACCCC nAT 433 
(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 

CCCCCACTAT GATATGCTTG GCCTATTGCG TCAATCCCCT TATATTTCGG CAATAGAGAT 60 

GGATGTATAT TCAATATTTT ACCTTCAAAT GAAGCTAATA AGTCTGGACC AATTAGA CGC 12 0 

ATGTAGC CAG CTAGAATAAT CCACTCTACC TTATCTTCAT TTAATAATGT TACTAAATGT 180 

TGTTCATACG CTGCTTTTGA ATCAAATTGT TTTGGTTCAT TAATATAAAC AGGAATATCG 24 0 

TGCTTTTTTG CTCTATCTAT ACAAAACGCA TTTTGATGAT CCGTATATAG CGCCGTAACT 300 

TCAATATTTT CAAGTTTTCC TGATTCAACA TGCTCAACTA TATTTkCAAA GTTACTTCCT 360 

GAACCTGATG CAAAAATCGC AATTTTAACC ATTGTTATAC CCCCAACAAT TCAATTGCAG 420 

TTGACTCATT TTTCACAATA TGACCAATTT GATAGGCTTC CACATTTTGT TCTGCTAAAA 4 80 

TCTTCAAAGC GCGTGATACA TCTTTTTCAT CAACGATAAC CGTATAGCCA ATACCCATAT 540 

TAAAAATGTT ATACATTTCA TTTGTGTCTA TATTGCCTTG TTGTTGTAAC CAATCAAATA 600 

TTTTTGGTGT TGGAAATGAT GTAGTATCAA TTCTAGCAGC ATATCCGGCT GGCAATGCAC 660 

GTGGAATATT TTCATAAAAA CCTCCACCAG TAATATGATT CATTGCCTTA ATAGAAACTT 720 

CTTTTTTCAA AGCAAGTACA GGTTTGACAT ATAATTTAGT TGGTTCTAAA AAGACATCGA 780 
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GCACTAAACT 


GTATCCATTT 


GAATGAATGC 


CACTTGACGC AAGCCCTATA ACAACTTGTC 






CCTCTTTCAC 


TTCTGAACCA 


TCTACATAGT 


CATCCTTTTC AACTGCTCCA ACAGCAAATC 




s 


CAGCTACATC 


ATATTCGCCT 


TCGTGATACA 


TTTCACCCAT TTCAGCAGTC TCTCCACCGA 






TAAGTGCAGT 


ATTCGTTTCA 


ACACATGCAT 


CACTAATACC TTTAACAATT TGTTCAATAA 




10 


CTTCAGGAAC 


AACTTTGTTT 


GTAGCAATGT 


AATCTAAAAA ATATAATGGT TCTGCACCTG 




TCGTTAAAAT 


ATCATTAACA 


CACATTGCGA 


CTGCATCGAT ACCTATCGAA TCATGTTTAC 






CATAGTCGAT 


AGCTAGTTTT 


AATTTCGTAC 


CTACTCCGTC TGTTC CAGAA ACTAAAACTG 




15 


GCGCTGtCAT 


ATTTAATTGT 


GATAAATCAA 


ATGTAGCACC GAATCCACCT AAACCACCGA 




TAACTTCTTT 


ACGCATCGTA 


CGtTTAACaw 


GaCtAGaCaT TCtTTCtACA GCTTCATAAC 






CAGCATGAAT 


ATTTACACCA 


GATTGTTCAT 


ATGCTTTAGA CATTTAAATT TCCCTCGCTA 




20 


TCAAAAAAGT 


GTTTGTTTTT 


AGAAATATAT 


TGCTTTTGTC GATGACTTAA ATGCGCTTTA 






TAATTTGCTT 


CATAATCATA 


TAACCCTGCA 


GGATAATCTC CAGTGAAACT TTCTACACAT 






AAGCCACTAT 


ATGGCGCGTC 


ATAATCTAAA 


CCAATTGATT CAATTAACCC ATCTACAGAT 


1620 


25 


AGATATGCTA 


ATGAATCAGC 


GCCAATATAA 


TCTTTAATTT CTTCAGGTGA TTTGCTTGCA 


1680 




GAAATTAATT 


CTGCCGTAGT 


TGAAACATCG 


ATTCCGTAAA AACTTGGAAA CATAAATTCC 


1740 




GGTGATGCTA 


TACGCACATG 


CACTTkATTT 


GCACCAGAAT CkTTTAACAT TTTCACAATG 


1800 


30 


CGTCGAATTG 


TCGTACCGCG 


nACAATGGAA 


TCATCAACAA GACTG 


1845 



(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 

TCATAAATnG TAAATGTTAA TCGTCATATA ATATTAATTT ACAACACCAT TTTGGTTATT 60 

TGAAGCTTGT GGCGCTTGTT GTGTGCCACC TTGATTTTGA TTTGAGTTTT GATCTGTAGC 120 

AGGTTGTTGT TGATTGCTGG AmtCACTGTT ATTAGTTGAm TCACTGTTCT CGTTAGATGT 180 

CTTATCTTTA TCTGTCGTAT CATTTTGTTC TTTTCTCAAT AAACTATTAT CTAAAGGCGT 24 0 

TAATGGTATT AATGAACCAT AATGATTAAT GACACGTTGA TCTAAGAAAT CATTTTTATC 300 

ATTAATAGGT GATAATTCTA AGTCTTTACG AAGTAAGTTT GCATATTTTT GAATGCTTTC 360 
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AGTTTTAATT TCAACATCAT TTGTTaAGTA CATTTTTGCT AAAGCTTTAA TTTCAGAGTT 4 80 

AGTTAAATTA TGCTTTGCAT TTTTACCTAC AATTTGAATC ACGTTATCAA GTTTATCAAT 54 0 

AGAATCAACT T CCTGTG CTT TTTGGAATAA AATCTTAATT AATTCCATTT GACGTTGTCC 6 00 

ACGTTTTAAG TCTGAATCAT GATGTCTAGT TCTAGCAACT GCTAAAGCCT CATCACCATT 66 0 

TAATTTTTGG TACCCTTTTT TAATTTTAAT CTTACCAGTA TCATCTGTGT TAGGTTCATT 72 0 

TAAGTCGTAT GGCACATCAT AGTATATGCC ACCAAGCTCA TTTACAGCCT CGACAAATGC 7B0 

TTTCATATTG ACTCTCACAT AATAATCAAC AGGTACATTC ATGGTAGCTT CTACCGAATC B4 0 

CATTGCGGCA ATTGGACCAC CATATGCATG TGCATGGGTA ATCTTATCGT AATAGCCAAC 900 

TTTAGGAATG TAGCTGATAG TATCACGTGG AATACTAAGC ATTCTAATTT GATGTTTTGA 96 0 

TTGATTAAAA GTAGTTAAAA TCATAGCGTC TGATCTAGAG TGTTCAGCAT C CTGTCCTTT 1020 

TTTTCTTCTT CCATCGTTAT CATCGATACC TAAGAAAAGA ATAGAGATAG GTTGTTCTTC 1080 

GGGATGACTT TATTATCTCT TAAGTTGGAT TGACGTTAGC ATTTTTGCTG TCTTGAGAAG 1140 



1141 



(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

GCACCATTAG CATTGGGCnC ACTGATTGGA GTAn CAGTTG TTGAAAATTC GGCGCCAACA 60 

AGTAAACmGG CroCaGGCaGC mATamCCCCa TATTATAcTT ATAATGGTTA TATTGGTAAT 120 

AATGCTAATT TTATTCTGGA TAAGAATTTT ATTAACGCGA TTAAGTATGA TAATGTGAAA 180 

TTCAATGGTA TTAAATTAGC TAAAACGAAT ACGATAAAAA AAGTAGAGAA ATATGATCAA 24 0 

ACTTTTAAAG GTGTTTCTG C AAAAGGAAAC GAAGCAAGTC AATTGCAATT TGTAGTTAAA 3 00 

AATAATATTT CATTAAAAGA TATCCAAAAA GCTTATGGCA AGGACTTGAA AAAAGAAAAT 360 

GGTAAAACAA AGGAAGCTGA TAGCGGTATT TTTTACTATC AAAATGCTAA AAAGACATTA 4 20 

GGCATCTGGT TTGTCGTTGA TCATAATAGA GTTGTC GAAG TAACAGTTGG ACATACACCA 4 80 

TACAAAACAA GTAAATAAAA TAATGGCATA TTAAGGC TAG AGTGTGAGGA GTGATACCGC 54 0 

ATTCTAGTCT TTTTTATTAA ATAATAACGA TTATTGCGTC TTACATAGTT GTTTGAAATT 600 
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GATTAAGTAT ATAGAGCACT ATTTTGTATT TGTTAATATT TTCACAAAAA TAAAGCCTTG 72 0 

ATAAATTTTA AAATATAATT AAGCTCAATT TTTAAAATTT TATTTAGCTA CAGATAA CAT 780 

TTTTAAAAAA GAAAAGAATC AATAAATAAA ATCAACGAAC AAAAAGTATA GAAATAAATA 84 0 

GAAATAATCA TTTAC TTTT C TGAAAAATTA AATTAATATT TTATTTATAA GCTGTTTTTA 900 

AGATTTCAGG AGGAATGAAA TGTGaGGAAA TTTTCAAGAT ATGCATTTAC AAGTATGG CA 96 0 

GCATTAACCT TGTTGAGCAC TTTATCACCA GCAGCATTAG CGATTGATTC AAAAAATAAA 1020 

CCAGCTAATT CTGATATTAA ATTTGAGGTG ACTCAAAAGA GTGATGCGGT CAAAGCATTA 1080 

AAAGAATTGC CTAAATCCGA AAATGTAAAA AATATTTATC AAGATTACGC TGTTACTGAT 114 0 

GTAAAAACTG ATAAAAAAGG ATTTACGCAT TATACATTGC AACCGAGTGT TGATGGTGTT 1200 

CATGCACCTG a CAAAG AAGT GAAAGTACAC GCAGACAAAT CAGGAAAAGT CGTTTTAATC 1260 

AATGGGGATA CTGATGCGAA GAAAGTAAAG CCAACGAATA AAGTGACATT AAG TAAAGAT 1320 

GACGCAGCCG ACAAAGCATT TAAAGCAGTT AAGATTGATA AGAATAAAGC GAAAAATCTT 13 80 

aAAGATAAAG TCATTAAAGA rAACAAAGTT GAAATCGATG GTGACAGTAA TAAATACGTT 14 4 0 

TATAATGTTG AGTTAATTAC AGTGACACCA GAAATTTCAC ATTGGAAAGT TAAAATTGAT 1500 

GCTCAAACTG GCGAAATTTT AGAAAAAATG AACTTAGTTA AAGAAGCTGC AGAAACTGGT 1560 

AAAGG 1565 
(2) INFORMATION FOR SEQ ID NO: 52 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT TTTAATATAG 60 

AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC TACTTTTCTA 12 0 

TCATATTTAT TTACTATCAT ATTGTGGGGA TCTGCATTTC CAATGATAAA GATTGCGTTA 180 

AATGATTTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC AATAATTCTC 24 0 

TTGCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT CCCTGTTATT 30 0 

TTTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT TGGTGAAaCT 360 

TTGATTAGTG CAGGTATArC TGGrATTCyA GTCTCTACAA CGCCTATTTT TTCTAGTGCT 42 0 
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GCATTTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA 54 0 

TTAGGTGTTT TTATTATTTT ACTTGCATCT TTTAGTGAAA GTTTGTATTT TACTTTCCAG 600 

s 

AAAAAATACA TAGAAAAATA CGGCTTCATC GCTTTCACAC TATa TACAAT AATGGCAAGC 6 60 

TCACCATTTA TGCTTATTTT TATTCCTGAA ATCATCAACG ATATACACGG CGCCACTTTT 72 0 

ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTTG 780 

10 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC TCTGATGCAA CAATGTCTCT TTATTTAACA 84 0 

CCTATCGTTT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

, 5 ATTATAGGCG GAATTATCAC TCTACTAGGC GTTAGTTTAA GTAACTTCTT TCAAAATACA 960 

TAATTATTCC AAGTCCCGCA CCTCAGAATC CAAAAACATT CGAGTGATAA AATTTTAAAA 102 0 

ATCAAAAATA TAAAAATGAT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 108 0 

20 CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 114 0 

ATTACA GTTA TGGAAATTAT GAAGCATTTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 12 0 0 

ACAAATCCGC TTACTTAATC GGATCTGGTC TAGCTTCACT TGCTGCAGCT TGTTTTTTAA 12 6 0 

25 

TAAGGGATGG TCAAATGGAA GGTTCGAAGA TTCATATTTT AGAAG AG TTA CCTAAAGCAG 13 2 0 

GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 138 0 

AAATGGAGAA CCACTTTGAA TGTTTGTGGG ACTTATTCAG ATCTATCCCT TCATTAGAAA 144 0 

TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 150 0 

CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTTTAGT CACAGACGGA GACTTCACTT 156 0 

35 TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT AACGAATGAA GAAGATTTAG 16 2 0 

ATGATGTCAA AATAACAGAT GTATTTTCCG ATGACTTCTT TAATTCAAAC TTTTGGATTT 16 8 0 

ACTGGAAAAC GATGTTTGCA TTTGAACCGT GGCATTCTGC AATGGAAATG CGTCGCTATC 174 0 

40 TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCTTTA AAATTCACTA 1800 

AATATAATCA ATATGAATCT TTAGTATTAC CTATGGTTGA ATATTTAAAA TCGCATGGGG 186 0 

TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATGTTACG ACAAGTCAAA 192 0 

45 

AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATGGTGATA 204 0 

ATGATACACC AGCGCCACCA ACTGACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

50 

TAGCGCGACA AAGTCCTGAA TTTGGTAATC CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 2160 

AAAGTTG GTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 2220 

55 
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TCAATGATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC 234 0 

AACCTGAAAA TGAAATATCT ACATGGATTT ATGCCTTATA TTCAGATGTA AACGGCGATT 24 00 

ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 24 60 

ACTTAGGTGT ATCAACTGAC AAAATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 252 0 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGACCTT 2580 

TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC 2 64 0 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 2700 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 27 60 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 282 0 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 287 0 
<2) INFORMATION FOR SEQ ID NO: 52 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

CnGTTTAGCT GCAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 60 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 120 

AAGAAATTTT AATTTTACTA ACTTTATGGC AAAAAGGTTC TATGACGCTT AAAGAAATGG 180 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 24 0 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 300 

ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA 36 0 

GTAGAGCAAC AGCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 42 0 

ATAAAAAGAG GTTTTCATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 48 0 

ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 54 0 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTG CAAAAC 60 0 

TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 66 0 

TCCTTCGCTA TGACCATGAT ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 72 0 

TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 78 0 
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TATATGATGT TCAATATTTT CATGTCCATC TTTTATTAAA GAAACATGAT TGAAATCCTT 90 0 

AACTTTATCA CGTGTATTTT CCAAAGCTAA ATCTTGAATG TCGAAACCAT AAACATGTCC 96 0 

5 TTCTGGTACT TGTTCGGCTA AAAATAAAGT GTCATTGCCG TTACCGCAAG TTGCGTCTAC 102 0 

AACAATACTT TCTGGTGTTA TATGTTGTTT AATAAGTGTT TTTGAAAAAG GGAGTATACG 1080 

TTCTAATTTC ATTGCTTCAC CTTAGACTTG TAACGCAAGC CTTGATAAGA ATTTCTACGT 114 0 

10 GCTAATTCAG CATCGATGCC ATTTAATACT TCCCATTTAT TAACACTCCA CATTGGACCT 12 00 

ACCATGATAT CTATTGGACC ATCACCGGTA ATTCGGTGAA CGATCATTTC AGGGGGAATC 1260 

ACTTCTAATT GGTC CACAAC TAGGTTTGTG TACTCTTCTT GAGTCATAAA AGTTAATAAA 132 0 

15 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA CCTTTTAACA AATGAAGTAA ATGAATTTTA 13 80 

ATACCTTGTA CATCCATTTG TGCAcTTCTT TGGCAGTAGC CATCATCATG TCATAGTCTT 144 0 

20 CGCCAGGTAA GCCATTAATG ATGTGTGTAC ATACATTGAT ATTATGCTTA CGTAATTTTG 1500 

CCACACCATC ATAATAAGTT TTCATATCAT GGGCACGATT GATTAAATCA GATGTTGACT 1560 

GATGGATTGT TTGTAGTCCT AATTCAACCC ATAAGTATGT TCGTTGATTC AAATCTGCTA 1620 

25 AATATTCGAC AACATC GTCT GGTAGACAGT CAGGACGCGT ACCAATAGAT AATCCCACAA 1680 

CACCCGGTTC TTTAAGTACA GGTTCGAATT TTTCTTTTAA TACTTCAACC GGTGCATGTG 174 0 

TATTTGTAAA TGCCTGAAAA TAAGCAATAT ATTTTCCTTC GTGCCATTTC TCATGCATCT 18 0 0 

30 TTTCTTTAAT TTCTTTAAAT TGTACTGCGA TTGAATCTGC ACGATTACCT GCAAAGTCTC 18 6 0 

CGCTACCTGC AGCAGAACAA AATGTACATC CACCATGTGC TACAGTGCCA TCGCGGTTAG 192 0 

GACAGTCAAA CCCGCCATCC AATGCAACTT TAAATATTTT TTGTCCAAAT TTATTTTTTA 198 0 

35 

AATGGTAATT CCATGTGTGA TAACGTTTGT TTTCAAAAGC GTATTGGAAA TGATTGCCCA 2 04 0 

TATGTCATTT TCCTTTCTAT AAAAAAAGAG TTCTAAGTAC AGATTTTAAC ATATTTTAAT 210 0 

40 GTTATAGTGT TTATTATAGT TTGACAAAAA AGAGAGAGGA ACTATGAAAT ATGAATATAC 216 0 

CTAAATCAGT CTGGTGGCTA GTAATTGGCA TGGCGTTAAA TATTACTGGT TCCAGTTTTT 222 0 

TGTGGCCTTT AAATACAATT TATATGAAAC AAGAACTTGG AAAAAGTTTA ACTGTTGCTG 228 0 

45 GTTTAGTGCT AATGATAAAT TCATTTGGCA TGGTTATTGG AAACTTATTA GGTGGTTCAC 234 0 

TATTTGATAA ATTAGGTGGA TACAAGACGA TTTTAATTGG AACTTTCACT TGTCTTTGTA 24 0 0 

GTACAACG CT ACTTAATTTC TTTCACGGGT GGCCTTGGTA TGCTGTATGG CTTGTAATGT 24 6 0 

50 TAGGGTTTGG TGGCGGAATG ATTATTCCTG CGATATACGC TATGGCTGGA GCAGTGTGGC 2520 

CAAATGGCGG AAGACAAACG TTTAATGCGA TATACTTAGC GCAAAATATT GGTGTGGCTG 25 80 

ss 
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ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 27 00 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACTGG TAAAAAGAAT AAAGCAAGAT 2 76 0 

TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 28 2 0 

GGGAGTCTAC AATCGCTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2 88 0 

TTTTATGGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2 94 0 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 3 000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA ACTTTA CAAT ATTTGyTGTC GGTATGATTA 3 0 60 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACTAT 3105 
(2) INFORMATION FOR SEQ ID NO : 53 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

TAATGATTAA ACCAGATGCA GTACAAAGAA ATCTAATTGG TGAAGTAATT TCAAGAATTG 60 

AAAGAAAAGG ACTAAAACTT GTCGGTGGTA AATTAATGCA AGTAC CAATG GAACTTGCTG 120 

AAACACATTA TGGTGAACAC CAAGGTAAAC CATTTTATAA TGATTTAATT TCATTTATTA 180 

CATCAGCACC AGTGTTCGCA ATGGTAGTTG AAGGTGAAGA TGCAGTTAAT GTATCTAGAC 24 0 

ATATTATTGG CAGCACCAAT CCTTCAGAAG CTTCACCAGG AT CAATT AG A GGTGATTTAG 3 00 

GTTTAACTGT TGGTAGAAAT ATCATTCACG GTTCAGATTC ATTAGAGTCT GCTGAACGTG 3 60 

AAATTAATCT ATGGTTTAAT GAAAATGAAA TTACTAGCTA TGCTTCACCA CGTGATGCAT 42 0 

GGTTATATGA ATAAAATATA AACTGTAAAC CTTTACGATT TATTTATAAA GGTAGAAAGG 4 80 

GTTTTGTTAT GTGGTTAGTC ATTATGaTTA TACATAACAA GGCCCGTTTT TTATGTTGTA 54 0 

GTAAATTACT TGAAAAATTT TATAGTTTTT kGGTAAcACG TAtTaAAAaG AGAGGAATAT 6 00 

TCTTTATCAA ATGAAACTAA ACAGAGAGAA GGGGTTGTTA AAATGAAgAA TATTATTTCG 660 

ATTATTTkGG GGATTTTAAT GTTCTTAAAA TTAATGGAAT TACTATATGG TGCTATATTT 720 

TTAGATAAAC CACTTAAT CC TATAACAAAA ATTATTTTTA TACTGACTCT CATTTATATT 78 0 

TTTTATGTAT TAGTAAAAGA ATTGATTATA TTTTTGaAGT CAAAGTATAA CAAAAGCGCT 84 0 

TAACATATGT ATATTTTAAT ATCATAATTT TTTTAAACGG ACTGATTAAC TTTATTAATA 900 
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GATACGATTA TATTAAAACG GCTAATCATT 
TAGAAATT CA TGATACTTTT CTACAGACGA 
5 TATTAAGATA ATTCTGACAT TTAAAATGAG 
AACAAACATT TATGAATGCA CAATGAATAT 
CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT 

10 

ACCTCAATTA ACAGTTATTG TTGAAGGTGT 
TATTAATAAA GAAATGTTTA AGCGTCAAGG 
TGAGAAAGAT ACAGTAGAAA TAGTATCAGG 

15 

AATTACTATG GTTGTAACCA ATGATGACTT 
TCCAATAAGT GAAGAAGAAC GTGAAAATAT 

20 TCATGCAGAT TTGGTTGGAG GTATGAAATA 
GCGATCATCT GCTAGAGAAA CAGCAGCTCG 
ATTACAACAG TTAGATATCG ATATATACAG 

25 TAAAGATTTT TATGATTCAG AAACATTTAA 
AATTGATGAC AGTATCGCAC AAGCAATGCG 
AGATTCAATT GGCGGTGTCG TTCAAGTTGT 

30 

TTATGTG CAT TATGATCGTA AGTTAGATGG 
T GCTTTT AAA GGTGTAAGCT TTGGTGAAGG 
GATTCAAGAT GAAATTCTAT ATAATAGTGA 

35 

AGGTGGTTTA GAAGGCGGTA TGTCAAATGG 
ACCAATTCCA ACGTTATATA AACCATTAAA 

40 TAAAGCAACA ATTGAACGTT CTGATAGTTG 
ACATGTCGTA GCATTTGAAA TAGCAAAAGC 
TGAGCAACTT AAACAACAAA TTATTG AG C G 

45 AGAACAATTG AGGTGTAATC ATGAAATTAC 
TATATGTTGA ACACGGTGCA ATTGACCATA 
GTTTTATATT AATTGACGAG CATGTAAATC 

50 

T ATCATAT G A AAATGTACAT AAAGTTATTA 
AGCAATATCA AGAAACATTA GAGTATATTT 

55 



TTTAATTAAT GATTATATGA TGCAACTGTT 10 20 

ATATATTATA ATTAATTTTA GTTCGTTTAA 10 80 

ATGTCATCCA TTTTCTTAAT TGAGCTTGAA 114 0 

GATAAGATTA ACAACATATT ATAATGTTAT 12 00 

GAGATACCTA ACATCAGGAG AATCACATGG 126 0 

ACCTGCAAAT ATAGAAATTA AGGTTGAGGA 13 20 

CGGTTACGGA CGTGGACGTC GTATGCAAAT 13 80 

CGTTAGAAAT GGTTATACAT TAGGTAGTCC 14 4 0 

TACGCATTGG AGAAAAATTA TGGGAGCAGC 1500 

GAAACGTACT ATTACAAAAC CAAGACCTGG 1560 

TAATCATCGT GATTTAC GAA ATGTGCTAGA 1620 

AGTTGCAGTC GGTGCCTTAT GTAAAGTGTT 1680 

TCGTGTTGTT GAAATAGGTG GAATTAAAGA 174 0 

AGCAAATCTT GATCGTAATG ATGTTCGTGT 180 0 

AGATAAAATT GACGAaG CTA AAAATGAAGG 18 60 

AGTTGAAAAT ATGCCTGTTG GTGTAGGTAG 192 0 

TAAGATTGCA CAAGGTGTTG TCAGCATAAA 198 0 

ATTTAAAGCA GCTGAAAAGC CAGGTAGTGA 2 04 0 

AATTGGTTAT TATCGTGGAT CTAATCACTT 2100 

AATGCCAATT ATCGTTAATG GTGTAATGAA 2160 

TTCAGTAGAC ATTAATACTA AAGAAGACTT 222 0 

TGCTGTTCCT GCAGCAAGTA TCGTCTGCGA 22 80 

ATTATTGGAA GAATTCCAAT CAAATCATAT 23 4 0 

CAGACAATTA AATATTGAGT TTTAACAACA 24 00 

AAACAACATA CCCTTCAAAT AATTATC CAA 24 6 0 

TTAGCACGTA TATTGATCAG TTTGAT CAAA 252 0 

AATATTTTGC TGATAAATTT GATGATATTT 2580 

TTCCAGCTGG TGAAAAG ACG AAAACATTTG 2 64 0 

TATCCCATCA TGTAACTCGT AATACAGCAA 2700 
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ACGAGGCGTG CACTTTATAC AAGTGCCAAC GACTATACTA GCGCATGATT CTAGTGTTGG 282 0 

CGGTAAAGTG GGTATTAACT CAAAGCAAGG TAAAAACCTT ATCGGTGCAT TTTATCGTCC 2 880 

AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 2 94 0 

TGGCTATGCA GAAGTTTATA AGCATGCGTT ATTGAATGGT GAATCAGCGA CGCAAGATAT 3 00 0 

CGAACAGCAC TTTAAAGATA GAGAGATATT ACAGTCATTA AATGGTATGG ATAAATATAT 3 060 

TGCTAAAGGT ATTGAAACGA AGCTGGATAT TGTTATTGCA GATGAAAAAG AA CAAGGTGT 312 0 

ACGTAAATTT TTAAATTTAG GTCATACATT TGGTCATGCT GTTGAATACT AT CATAAAAT 318 0 

ACCTCATGGT CATG CAGTGA TGGTTGGCAT TATCTATCAA TTTATAGTTG CGAATGCTTT 3 24 0 

GTTTGATTCT AAGCATGATA TTAATCATTA TATTCAATAT TTAATACAAC TCGGCTATCC 3300 

TTTAGACATG ATAACTGACT TGGATTTTGA AACGTTATAC CAAT AT AT G C TAAGTGATAA 3360 

AAAGAATGAT AAGCAAGGTG TACAAATGGT CTTGATTAGA CAATTTGGAG ATATCGTTGT 34 20 

ACAACATGTT GATCAACTAA CATTACAACA TGCATGTGAA CAATTAAAAA CATATTTTAA 34 80 

GTAGGTGAAT GAAATGGTAA ATGAACAAAT CATTGATATT TCAGGTCCGT TAAAGGGCGA 3 54 0 

AATAGAAGTG CCGGGCGATA AGTCAATGAC ACACCGTGCA ATCATGTTGG CGTCGCTAGC 3600 

TGAAGGTGTA TCTACTATAT ATAAGCCACT ACTTGGCGAA GATTGTCGTC GTACGATGGA 3660 

CATTTTCCGA CTGTTAGGTG TAGAAATCAA AGAAGATGAT GAAAAATTAG TTGTGACTTC 372 0 

CC CAGGAT AT CAATCTTTTA ACACGCCACA TCAAGTATTG TATACAGGTA ATTCTGGTAC 378 0 

GACAACACGA TTATTGGCAG GTTTGTTAAG TGGTTTAGGT ATTGAAAGTG TTTTGTCTGG 384 0 

CGATGTTTCA ATTGGTAAAA GGCCAATGGA TCGTGTCTTG AGACCATTGA AACTTATGGA 3900 

TGCGAATATT GAAGGTATTG AAGATAATTA TACACCATTA ATTATTAAGC CATCTGTCAT 3960 

AAAAGGTATA AATTATCAAA TGGAAGTTGC AAGTGCACAA GTAAAAAGTG CCATTTTATT 4020 

TGCAAGTTTG TTTTCTAAGG AACCGACCAT CATTAAAGAA TTAGATGTAA GTCGAAATCA 4 080 

TACTGAGACG ATGTTCAAAC ATTTTAATAT TCCAATTGAA GCAGAAGGGT TATCAATTAA 414 0 

TACAACCCCT GAAGCAATTC GATACATTAA ACCTGCAGAT TTTCATGTTC CTGGCGATAT 420 0 

TTCATCTGCA GCGTTCTTTA TTGTTGCAGC ACTTATCACA CCAGGAAGTG ATGTAACAAT 4260 

T CAT AATGTT GGAATCAATC CAACACGTTC AGGTATTATT GATATTGTTG AAAAAATGGG 43 2 0 

CGGTAATATC CAACTTTTCA ATCAAACAAC TGGTGCTGAA CCTACTGCTT CTATTCGTAT 4380 

TCAATACACA CCAATGCTTC AACCAATAAC AATCGAAGGA GAATTAGTTC CAAAAGCAAT 444 0 

TGATGAACTG CCTGTAATAG CATTACTTTG TACACAAGCA GTTGGCACGA GTACAATTAA 4 500 
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AAACTTGTTA GGGTTTGAAT TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA 4620 

ATTTAAAACA AATGCAACAG TTGATAGTTT AACTGATCAT CGAATAGGAA TGATGCTTGC 4 680 

AGTTGCTTCT CTACTTTCAA GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 4 74 0 

ATCATTTCCA GGATTTTTAC CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 4800 

GGAAGATATC TATAAATTAA TAGACGATAT CAATCTACAA AAACTAGAAA ATTTAGACTC 4 86 0 

TCGTGTTAAT GAAGCAATAA CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 4 920 

ACTTTACAAT TTTGGATTAA TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 4 9 80 

ATATCCAGAC GAAAGTGAAT TGCTGATTTA TTTTATTGAA GG TTTAATGT CTGAAAATCA 504 0 

AACTGACGAA GCGTTAGAAT ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AGAAGCAGAT TTATATCAAC AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 5160 

AGCACTTGAA CTAGAGCCAA ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 5220 

TTATGATGGT CAATATTTAC GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 52 80 

ATATCAAGTT AATGGTG TAA ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 534 0 

kAACTATAGT GATkCcGATt. CgCTTATACG ATGrAATTAA TGAAGATGAA ATGACTTCAG 54 00 

AAGATTATCT CAAAAGAGCC ATTTCTnACG ATAAAAATGA CATCACTCAA GAAGCAATTA 5460 

AAATAATGAC TACATTACTT TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 5520 

aATCaTTATA TG 5532 
(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATGGATAAAA AAGAATTAGC 60 

GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 120 

ACTTGGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 180 

AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 24 0 

TTTGTTTC CT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 300 

ACCTCCCCGC CAAGAAAACG TACTTAACTA CGCAAATGAG CAATTAGATG AACAGAATAA 3 60 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 4 80 

TAAAGAAGAC GAAACGCCAT CAAATG CTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 54 0 

GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 600 

TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG 6G0 

CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 72 0 

GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 780 

CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 84 0 

TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 90 0 

ATTCACTTAT GAGTTACCTA AAGATTTATC GTCACATAAT GC 942 
(2) INFORMATION FOR SEQ ID NO: 532: 

(i) SEQUENCE CHARACTERISTICS .- 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 

TGGnAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 60 

GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 120 

TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 180 

TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 240 

GTATGGATCA TGCTGGTATT GCGACACAGG CAAAGGTAGA AGCTAAATTA AATGAACAAG 300 

GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 3 60 

AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 417 
(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 60 
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ATTTTCATAC CATCCACCTC CATAATCATC TTAACGCGAA CATTTTGAAA GCGCAATCAA 180 

AAATCCACAA AATTGTAAAG GTTATTACAC TGACTTTTCC GAAAATTGTG GTAAAATATA 240 

ATTAAGAAAG AACAAGGAGG CACTTACTAT GATTACTTAC AAAAATATTT TAATCGCAGT 300 

TGACGGTTCA CATGAAGCGG AATGGGCATT TAACAGAGCA GTTGGTGTTG CTAAACGTAA 3 60 

CGATGCGAAG TTAACAATTG TGAATGTAAT TGATT CAAGA ACGTATTCTT CTTATGAAGT 4 20 

TTATGATGCT CAATTTACTG AAAAATCTAA GCATTTTGCA GAAGAATTAT TAAATGGTTA 4 80 

TAAAGAAGTA GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 540 

TCyTAAATCT ATCATTCCTA AAAAGCTTGC ACATGAAATT AATGCAGACT TGATT ATGAG 600 

TGGTACATCA GGCTTAAATG CCGTGGAAAg ATTTATTGTT GGTTCTG TAT CAGAATCTAT 660 

CGTTCGTCAT GCGCCATGTG ACGTGTTAGT TGTTCGTACT GAAGAGTTAC CAGCAGACTT 720 

CCAACCACAA GTT 73 3 
(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6060 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 
TGATGATCCT GAAGCGCTAT TGGATAATTA CAACACTGAA GATGTTGATG CACACAATTA 60 

CAATAATATA AATCATGTTA TTTTTGCCTG CGATGCGGGT ATGGGTTCTA GTGCAATGGG 12 0 

TGCArGCATG TTACGTAATA AATTTAAAAA GGCGGGCATT AATGATATTA CAGTTACAAA 180 

TACTGCGATT AATCAATTGC CAAAAGATGC TCAATTAGTT ATCACTCAGA AAAAACTAAC 24 0 

TGATCGTGCT ATTAAACAAA CACCAAATGC CATCCATATT TCAGTGGATA ATTTCCTTAA 300 

TTCACCAAGA TATGAAGAAC TTTTAAATAA TCTAAAAAAA GATGATCAAG CATAATAATT 360 

AAATAAATTA AAAAATGGAG GATACCGCCA TGTTATTGAG TACACGTGAA AAAGAAATGA 42 0 

TAGCCCtAtT GATTAAGTAC CACGGtCAAt ATATCACTAT ACACGACATT GCTCAGCAAC 48 0 

TTGCGGTGTC CTCTCGTACT ATTCACCGTG AATTAAAAGG TGTTGAAGCA TATTTAACTT 54 0 

CATTTTCATT AACTTTAGAA CGCGCAAACA AAAAAGGGcT ACGCATTGCT GGCACAGATT 600 

CTGATTTAAA CGATTTGAAG CAATCGATTG CACAACATCA AACCATTGAC TTATCTGTTG 66 0 

AAG AG GAG AA AGTAATTATT ATATACGCTT TGATACAAGC CAAGGAGCCA GTTAAACAAT 72 0 
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TAGAGCTTGA TTTAAATAAG TACCAACTAT 
ACTTGGTAGG TACTGAATCA AAGAAACGTG 

5 TAAATAGTAC TAGCGTTTAT TCAGTAATTG 
AAATCCACAA AGACTTTGTT GACTTAGAGC 
ACTACCTAAG TGCCTTACCC TACCAACTTA 

W ATATCGTGCT CTCCATTTCA CGTATAAAAA 
TTTATGATTC TGTACAAAAC ACATTTGAAC 
TTGGTCAAAT ATATGACGTC ACGTTTAATC 

IS 

TACGTGGAGC TAAACGAAAA AATCTTAATG 
ACAAAATTAA AGCGTTTGTT AACAAAGTAG 

20 TGGATACTTT AGTAGATGGA CTGACGCTAC 
CTAATATCGA GACCTATAAT CCGTTAACAG 
TTGAAAATGT AAGATTAGCT TTAAATGATT 

25 AAATTGCTTT TATAGTtTTA CACTTTGGTG 
TAAACATATT AGTCGTTTGC AGCAGTGGTA 
TAGAGCAAGT TTTTAGTGAG ATTGAG CGTA 

30 CACTAGATTT AAGTCAATAT GATGGCATTA 
CCTATTTAAC GGTAAACCCA TTATTACCAG 
TAAATACAAA GTCTACGTTC CAAGAGACGC 

35 

ATGATGTTCA TGTTGAAACG AAAGATGTTG 
CTTCATACTT AACTTCAGTT TTCGAAAAAC 

4Q TGCATCATAT G CGTTCGGGT TTAACTTTGC 
TTAAACAGTG G CAAACAT AT AT CGCAGATT 
CAACGTCATT CGCTGAACTA CTAGAGCAAC 

45 GTCCATATCC TGTTG CAATA CCACACCTAA 
TAATCACAGT TTTAGAAGAA CCGTTAACAT 
TTAAATATAT GATTTCCATG TTTATTTCTG 

50 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG 
AGGAACTTGA AACATTATTA AGAAACAAAT 

55 
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CTTTATCTCG AAAGCGTGGC GAAGGCATTT 84 0 

AATTTTTAAG TCAATTAATG GTGAATAACT 900 

AAAATCATTT TGTCTTTCAT TCATTAAATC 96 0 

GCATTTTTAA TGTTG AAAGA CtATTaATGG 1020 

CCGAATCAAG TTATTTAACT TTAACTGTCC 10 80 

ATGGAGAG t A TGTCGCATTA AACGATGATA 114 0 

ACAAAGTaGc AAGCGAACTT GCTGATAAAC 12 00 

AGGCAGAAAT TGCTTTCATT ACTATCCATT 12 6 0 

ATACATCATT AAATAATCGT TGTGAAGAAA 1320 

AAATGATTTC CGGTATGACA TTTGCAGATT 1380 

ACCTTAATCC TGCAATCAAT CGTTTGCAAG 14 4 0 

ACATGATTAA GTTCAAATAT C CAAG ACT AT 1500 

GTTGG CCTG A TTTGATTTTT CCAGAGAGTG 1560 

GCTCGATTAA AAACCAAGGT AATCGATTTT 162 0 

TGGGAACTAG TCGTCTATTA TCAACTCGTC 16 80 

TTACACAAGC ATCAGTCAGC GATTTGAAGT 174 0 

TTTCTACTGT GAATTTAGAC ATCGACTCCC 1800 

ATAGTGATAT CAGTTATGTC GCACAGTTTT 18 60 

ATGATAAATC ATCAAACATG ATTGATAAGG 1920 

ATGGCAACAC ATCTTTTGAA AATGAACAAA 19 80 

ATTTAAGTGA CGAAAAATCA GAACAATTAT 2 04 0 

TTGATTCAGT GAAAATAGTT AGTACCGAAG 2100 

ATCTATATCA ATGCGATGTA ATAAACGATC 216 0 

GATTGATTGA CAATCCAGGA TGGATATTAA 2220 

GAGACAATAT GATTAAACAC CCTATGATTC 22 80 

TGCCTAGTAT TCAAAATGAC AATCAAACAA 234 0 

ACAATGATTT TATGGCATCA CTGGTAAGTG 24 00 

AATCTATTGA TACTTTTATG GAAAATCCAC 24 6 0 

TTTTAGAACG AATTAAAAAA CAATTTATTT 252 0 



1467 



EP 0 786 519 A2 



TAACAGCCAA AATGAAGCAA TTGAAAAAGC 
AACAGATGCT TATATTCAAG CAATGAAAGA 
S AAATGGCTTA GCAATT C CTC ATGGCACAGA 
TTTAACATTA TTACAAATCC CTGAAGGCGT 
CGTGGGAATT GCTGGTAAAG ATGGCGAACA 
ATTTAGCGAA GAAGAAAATG TGGATCGTAT 
ACAAGTATTC GAGGAGGCAG ATGCATAATG 

1S GGTCGTGGTT TCATTGGTTA TATTCTgCAG 
TCAATGAAGA AATCATTAAT G CTTTAG CTC 
ATGAGTCTAA AACAACGACG CGCGTGAAtA 

20 CTGAAGCGTT GAAACAAGCA ATTCTAGAAG 
ACATACTACC TATTATTGCT AAATCTTTTG 
TTAATATTGT TGCTTGTGAG AATGCTATTA 

25 TTGATATTAC TGGCCCTCTT GGTAACnaTA 
TTGTACCATT ACAAAAGAAT GAAAATATAT 
GGGTTGTTGA AAAAGATGCA TGGTATGGTC 

30 

ATTTAACACC ATATATTGAG CGTAAATTAT 
CGTATgCTGG tAAATTTGCA GGTAAAGCTA 

3s TTGAAGCTGG CTTACGCCGT GTTTTAGCTG 
ATTTTACTGA AGCGGAACAA GCTGGTTATG 
CTTATTTATC TGATGAAGTA ACACGTGTCG 

40 AAGATAGAAT TATAAAACCA TTAACATATC 
TATTAAATAC AGCTGCATTG TTATTGAAGT 
AGAAAAATAA TTACATTAAA GAACACGGTT 

45 TTGACGATGG CTTAGCCGAT GAAATAATTG 
AGCTTTGTTT GAAACAAGAA GTTTCCAACG 
AGTTCCCTTG AATTAACAAT ATTAAATTTC 
ACACTTAACT TGTGTTATGT ATGAAAGGAG 
CGAGCCACAG CCACCTATTC AATGGCTATT 

55 



AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 2 640 

TCGTGAGCAA GTCGTATCAA CATTTATGGG 27 00 

TGAAGCTAAA ACAAATGTGA TTCACTCAGG 2 760 

TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 2 8 20 

TTTAGACTTG TTATCTAAAA TTGCAATTAC 2 880 

CGTTCAAGCA AAATCTGCAG AAGAAATTAA 2 94 0 

AAAGCAGTTC ACTTTGG TGC TGGTAACATA 3 000 

ACAACAATGT TAAAGTAACA TTTG CAGACG 3 06 0 

ATGATCATCA ATACGATGTT ATTTTAGCTG 312 0 

ATGtTGATGC AATTAATTCA ATGCAACCTT 3180 

CTGATATTAT TACAACAGCT GTTGGTGTTA 3240 

CGCCTTTCTT AAAAGAAAAA ACAAACCATG 3 3 00 

TGGCAACTGA TACATTGAAA AAAGCAGTAC 3 3 60 

TTCATTTTGC TAACTCAGCA GTTGATAGAA 34 20 

TAGACGTTAT GGTTGAGCCA TTTTACGAAT 3 480 

CAGAACTAAA CCATATTAAA TATGTTGATG 3540 

TAACTGTGAA TACAGGACAT G CATATTT AG 3600 

CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 3 660 

AAACTAGTCA ATATATTACT AATGAATTTG 3 720 

TTGAAAAAAT AATAGATCGT TTCAACAATT 37 80 

GACGAGGTAC ATTACGTAAA ATTGGCCCTA 3 84 0 

TTTATAATAA AGATTTAGAA CGCACTGGTT 3 900 

ATGATGATAC AGCAGACCAA GAAACTGTTG 3 96 0 

TAAAAGCGTT TTTAAGTGAA TATGCTAAAG 4 02 0 

AAG CGTACAA TTCACTTTCA TAATTTATTG 4 080 

TTATTCGTTA ACAATCAGTA ATAATGTAGT 4140 

TGAACATAAA AAATACTCCC TTCAACATAG 4 200 

TATTTTTGCG TTAATAATTT GTTTTATTTT 4 26 0 

GGTCATTACT AAAACAAATT CATATTAACT 4 320 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG TCTTTCTTAG TGAATCTTCT AATTAAGAAT 44 4 0 

GCCATACCTG CACCTAGAGC TAATTCAGCA TATGGTAAAT CGTCATTATG TGACATACCA 4 5 00 

GTATCTGGTA AAGTTTTAGC TTGTTGTTTA GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4 560 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA GTTACATTAA GCATATCTTG ATTAGCACTA 4 620 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT GCATTGGCAC CGTCGTTTTG CGTAgyTTTA 4 680 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC GTATCATTAG TATCTGCTGT TGCCGTATCA 4 740 

TCTTTTTGGC TAACATTAGT TGAAGTCATT TTTTCTTTTG CTT CAGAAGA TGCAGATGTT 4 8 00 

GATGGTTTAT TCGAAACTTC AGTATCAGCT TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4 860 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT GCATCAGTTT GATTTGT CGT ACTTTCTTCT 4 92 0 

TTATCTTTTG ATGTATTAGA AGGTACATTT GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4 980 

TCAGTTGTtG ATT CAAC CAT ACTTTGATTT GTTGAATCAC TACCATCTTT TTCTGCCTTA 504 0 

GCTTTATTTT CAGATTTTGG TTGTGCAACC TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 

TTATTTACTT CAGCATTTTG TTTTGAATCA TTTACAGATG CATTATCTTT GCTAT CAGCA 5160 

GATGATGCTG CTTCTGTGCT CGCAGTTGTT GGAGCCGTTG CTGTTGATCC TGTTGGTGCA 5220 

TTCTCGTTTG TTGCTGTAGT TGTACTATTG TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 52 80 

TTATCAGTTT CTGTTACAGG TTTATCAGTT GTGCCGTTAT TAGTTGATTC TACTTCTGGT 534 0 

TTACTAGTTA CATCGTTATC CATTGTCGGA CTGTTTGTTG ATGCATCTAC ACTAGAATTG 54 00 

TTATTAGCTT GCGGTTTATC ATTTG CATCA TCAGTTGCTG ATGTTGCTGT TGTTTCACCT 5460 

GTTGCCGCAT CACTATTATT TGGTGTTGTC GGAGAAGCGT CTGCTTTGCC ATTAGCTGTC 5520 

GTCTCAGATA CGTTAGGTTG TCCAGTATTT TCTGGTGTTG CATTAGCATT TGAATTTG CT 5580 

GTTGCATCAT TATTATCTAT ACCATTATTA GTATCATTAG CATCTGGATC ATTCTGAGGC 5640 

ACAATCGCTT CAATTGCAGG TATCGTTACA TTTTGTAATT CAGCAACTTC TG CATTTGTT 5700 

TGTGTTTTAT CTAATTTATC AGCAAATCTG TCAAAATATC TACCTAAATC CGTACGTGCA 5760 

ATTTCTTTCG CCGATGCATC TGCATCTGCA TTTTTAATTA TTTCTATTTG CTTGTTAACC 5 820 

ACTTCTCTGA TTGCTTCCAA AGCATTTTT C TTAACTTCAG GATTAATACG TTGTGCTTTA 5880 

AGTTGTTCAA GCGCACTATT TTTGACAGTA GCGATTTCTG CATTTGTAGT TTGATCAGAA 5 940 

ATATCTTCAG TTG CTTTTGA TAAAATGTCT TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6000 

TCAGTTGTAG CATCAGCGTT GACATTTACA CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 6 060 
(2) INFORMATION FOR SEQ ID NO: 535: 



EP 0 786 519 A2 



(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 5: 
AACAAAGCCT TCCAATTATC TGCGTCGGTA GAACAAGTAT TAGCAA CTTT ATCACCTACG 60 

CTAAACAGT C CTTACGATTT ATACGGCACG ACAAAAATGC TAGATATTAC ATTCGATTCA 120 

TTTGAACATG ATGGTACAAC GTACCCTGTC GACTATGCTA CGTTTGAAAA TGATTATGAA 180 

GATAATAAAG ATCCTGAGTT TAGACGTAAA AGTTTCAAAT CGTTTAGCGA TGGGATTCGA 240 

AAATATCAGC ATACTACCGC GGCTACATAT AATATGCAAG TACAACAAGA AAAAATTGAA 300 

GCTGATTTAC GTGGATTTGA ATCAGTCATC GATTATTTAT TACATAGTCA AGAAGTAACG 36 0 

CGTGATATGT TTGACCGTCA AATCGATATG ATTATGCGTG ACTTGGCACC AGTTATGCAG 4 20 

AAATATG CTA AACTTTTACA ACGTATT CAC GGATTAGATA ACATGCG TTT TGAAGACTTG 4 80 

AAGATTTCTG TAGACCCTGA TTATGAACCA GAGATTTCAA TTGAAGACTC AAAAAATTAT 54 0 

ATTTTCGGTG CGTTAAGTGT TTTAGGTGAT GACTATACAA ACATGTTACG TGAAGCATAC 600 

GATCAGCGAT GGaTTGATTT TGCACAAAAT AAAGGTAAAG ATACAGGCGC ATTTTGTGCA 660 

AGTCCATACT TTACACATTC ATATGTGTTT ATTTCTTGGA CTGGTAAAAT GGCTGAAGCA 720 

TTTGTCTTAG CACATGAATT AGGTCATGCA GGTCATTTTA CATTAGCTCA AAAACATCAA 7 80 

CCATATCTTG AATCAGAAGC ATCAATGTAC TTTGTTGAAG CCCCTTCTAC AATGAATGAA 84 0 

ATGTTGATGG CCAATTATTT ATTTAACACA AGTGATAATC CAAGATTTAA GCGTtGGGTT 900 

ATTGGCTCAA TTTTATCTAG AACATATTAT CATAATATGG tACCOmTTTA TTAGAAGCnG 9 60 

CTTATCCACG GGGAGTG 977 
(2) INFORMATION FOR SEQ ID NO: 536: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 
AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 60 

TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATT CATCT AACGCTGCAC CTCGTAATTT 120 
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AATCTATTAA AGTGTAGCGa TTTATATTTT ATTAAATCTG AATCGTTACT TTATTTAATT 24 0 

TTATGCTAAT CCAGCGCGTT CGAAAATAGT GTCAACTTGa TTCAAATGAT GTTTAGGATC 300 

GAAACATTCA TCCAATTCTT CTTTTGTTAA AACACTTGTA ATAGACTCAT CTTGTTCGAT 3 60 

TAATTCACGG AACGGTGTTT TCGTTTCCCA AGATATCATC GCTTTTGGTT GTACTTTGTC 420 

GTATGCTTCT TCACGAACCA TACCTTTATT AATTAATGCT AATAAGACAC GTTGTGAGAA 4 80 

AATCAGACCA AATGTTTTAT CTATGTTATT ACGCATATTA TCTTCAAATA CAGTTAAACG 54 0 

GTCCACAATA TTTGTGAACG ATTCAATGCA TAATCTAGTG CtATTGTAAC ATCTGGTAAC 6 00 

ATAATACGCT CAGCAGAAGA ATGAGAAATA TsTCTTTCAT GCCATAATGG CACATTCTCA 6 60 

TAAGCTGTAG TAATATAACC ACGAATGACT CTTGAAATAC CTGTGATATT TTCAGAACCA 72 0 

ATTGGATTTC GTTTATGAGG CATTGCAGAT GAACCTTTTT GGCCTTTTGC AAATGCTTCT 780 

TCAACTTCTC TCGTTTCGGT TTTTTGAAGG TTACGTATTT CAACGGCAAA TTTTTCTAGT 84 0 

GATGTCGCGA TTAATGCTAA TGTCGCAATA TAGTATG CAT GTCGATCGCG TTGCAATGTT 900 

TGCGTTGATA CAGGCGCTGT GCCAATACCT AAATGTTTAC ACACATAACT TTCTATTTCA 9 60 

GGAGGAATGT TAGCAAAAGT ACCTACTGCA CCACTCATTT TCCCTACTTC AATTTCTTCT 102 0 

CTTACTTGTT TGAAACGTTG TAAGTTACGT TGCATTTCCG TGTACCACAA TGCCATTTTG 1080 

ACACCAAATG TAGTTGGTTC TGCATGCACT CCATGTGTAC GTCCCATCAT CAATGTATAT 114 0 

TTATAATTTT TTGCTTTTTC AGCTAAAACG TCGATAAATC TTTCTAAATC TTTTTCAATA 12 00 

ATGTCATTTG CTTGTTTAAT AACGAAACTT AAAGCTGTAT CTACAACATC AGTAGAAGTT 12 6 0 

AAACCATAAT GTACcACTTA CGTTCTTCAC CTAGCGTTTC AGAAACTT GT CTAGTAAAGG 1320 

CTACAACATC ATGGCGCGTT TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 1380 

CGTTTTGACG AATTTTTTGT ACGTCAGCTT TCGGTATATG TCCTAATTCA CTCCATGCTT 144 0 
(2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 53 7: 

GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 60 

TATCTgTATA ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA CCCTCAGCAA 120 
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CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 24 0 

TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 3 00 

S 

AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 360 

CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 420 

TTCTTTCGTC TCTTACATCA ACATTCACAC CTAATTCACT AAATTTAGCA GTTAATGTTT 4 80 

10 

CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TCCACATGCT GCAGCGATAC 540 

ACATATATGT GCCTGCTTCA ATTCTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 600 

1S CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 660 

TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 72 0 

TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCAC CT ACGCnAACCA 7 80 

20 TATC 7 84 
(2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 
CAATCTCCAC CAATGACACG TCGTCATTCG TGACCTCATA CCArACAAAA AACAGTCTCG 60 
CAATCAAGAC TGTTTTCCAC TCAATATATT CATCCATTAG CGTAATAGAT TATTTGACTT 120 

35 

CTGTAGCTAC AAAGaTTTTA CGTTTTTCCC AAACGCCTGT cTTTTCATTG TAATCATCAC 180 
AAGTAATTAA TGTTAATTGT TTATCTTTAC CTTTTTGTTC ATCTAGAAcT CCTACATCTG 240 

40 TAGGCTTAAC ATCTCTTATA CTTGTCATTT TATACTTACG TGTTTCATTA CCAACTTTAA 3 00 

AGTACACCAT ACTACCTTTT TTGGCTGCTT TAAGATTTGT AAATTGATAG TTCGGACGGT 3 60 

CAATGAAAGT GTGTCCTGCA ATTGAAATAT TTTGATCATC TAGTGATTCA TTTTCTTCTG 420 

45 CAAAGCTTAC ACCTCTATTT AATTGTTCAG GTGTTGCTGG TCCTGGATAT ACTGGTTCTT 4 80 

TAATATCAGC ATCTGGAATT TCAATATAGC CTGCCACTTT CGATTTATCT TTCGGAATTT 540 
GAGGTTTAGC TTGCTGCTTT TTATCTTTAC TCGCCTGTTC TTTTACATTT TTATCATATT 600 

SO 

GTTCAATCTT TTCATCTTTA TCTTTATCGT GAAGATAATT ATCGATATGT GGTTTAGCAA 660 
ACAAATATGC TGCCACTAGG aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 720 
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AGTATACCAT 


TAATTTCAAA 


ATGACT CATA 


GCAATTCATT 


TTATACTATA 


AAATTTACAT 






GTATACTTTT 


ACGTTAGATT 


TCATT A CACA 


TATTTGCATT 


CAAATAACGA 


AA CG CTTTTA 




5 


ATAATTACTA 


AGGGGGAATT 


GATATGATTA 


GATACGCTAA 


AAAAGAGGAT 


TTAAACGCTA 






TATTAG CGAT 


ATACAATGAT 


GCCATTATCA AT ACTA CAGC 




TATGAACCAC 






AAACCATAGA 


CGAACGTGTC 


GCATGGTTTG 


AAACGAAACA 


ACGTAAGCAT 


GAGCCTATCT 




10 




GGAAAATGGA 


AGTGTCTTAG 


GGTTTGCGAC 










GGCCAGCATA 


CCTATATACA 


ATCGAACATT 


CTATTTATGT 


CGATGCTTCA 


G CTAGAGG AA 




15 


AAGGTATTGC 


TAGTCAATTA 


CTACACCATT 


TAATTGTGGA 










GTGCGCTAGT 


TG CAGGCATT 


GATGCTTCCA 


ACAAAGCGAG 












GCATGCCGGC 


ACACTGACCA 


ATGTAGGTTT 












TTACGAATTA 


GATTTACAAG 


ACTAGTAATG 


TTTG AAT CA C 








CAAGACAACC 


ATGTTAATTC 


CCTTAACATA 


ACAAGCCAAC 


ATATAAAATT 


TTAAACTTCT 






CAGGGGAGTG 


GGACAGAAAT 


GATAAAGAGC 


CACTAATGAT 








25 


CACATTAGCC 


ACAG CTAATG 


TGTAcTTAAA 


AATAGGAATA 


CATGAGTAAA 


ACTC ATG CAT 






AAGAAATACT 


AATTTCTATA 


GAAAAAGTAT 


TTCTTTATCG 


TCGTCCCACC 


CCAACTCGCA 






CATTATTGTA 


AGCTGACTTT TCGTCAgCTT CTGTGTTGGG 


GCCCAAAAAG 


CTTGTTACAA 




30 




GTTCAGTCAA 


CTACTGCCAA 


TATAACTTTG 


TAGAG CAT AT 








TACATTGTCC 


CTTTTATTTA 


TTCTTTTCAA 


ATACTATCCC 








35 




AACAAAACGA 


ATATAGTAGA 


ACATGAAAAC 


GATAATCATG 


CTGAGCGATA 






AAGATTTAAA 


TAATAGATTG 


ACCCACGTTC 


CCTCAGTCGT 


ATATCCATAT 


GTAATCGTTG 






TGTTAATGAT 


GAATGCTATA 


AAGATGATTG 


ATAGTCTTAG 


CATATCATCA 






40 


GTTATTTTAG 


ATATACGGGG 


GCGCTTTTGC 


AATCACTATT 


TTGATTAGTA 








ATAAATCTTT 


CAACTTCTTC 


AGAGATAATT 


AAGAAGCATC 


TATCTGGTAC 


TAATGATCCA 






GACAGATGCT 


TCTTTTTTAT 


CAATATTTTA 


TTGTTATCTC 




TTTAACCATA 






TCTTCAGCTG 


TGC CAAAGAT 


TTTACGTTTA ATTGCTTCGC 


CAGTTGGTGT 


GCCTGCTAGT 


2280 




CCACCCAATC 


CAGTTTCACG 


TAATGATGCA 


GGAAGGTTAC 


GACCAACCTT 


ATCCATTGCT 


2340 




TCAATAACTT 


CATCAACAGG 


GATTCTACTT 


TCAATACCTG 


CTAATGCTAA 


ATCTGCTGAA 


2400 


SO 


ATTAAAGCGT 


TACCCGAACC 


AATTGCATTT 


CTCATAACAC 


AAGGAATTTC 


AACAAGTCCG 


2460 




GCTACTGGAT 


CACAAACTAA 


ACCTAATAAA 


TTACTTATCG 


CTAATGCCAT 


AGCGTGCCCG 


2520 
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GAACCAACTT CAGnTTGGCA GCCACCTGTT GCACCAGCTA CACTTGCATT GTTTGCTACG 264 0 

ACACGCCCAA ACAATG CTGA AGTGAATAAG AAATCAATCA TTTGCTCTTC TGTTAAATCA 2700 

TGTGTTTTTT CTAATTTAAA AAGTGCACCG GGAATGGTAC CCGAGGAACC AGCTGTTGGC 27 6 0 

GTCGCACAAA TAATACCCAT CGCAGCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 2 82 0 

CGTCAATCAT TTCATATCCA GACAAAGCAT GATGTGTTTC ATTATAATCA CGTAGTTTAG 2 8 80 

CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 2 94 0 

TTACTGCGTC TCGCATGACA TCTAAATTTT GTTTCATTTG CGCTCGCACT TCATCACGTG 3 0 00 

ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAC ATATTATTTT 306 0 

CTACGGCATA ATCTATAGTC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTCTAATTT 3120 

ATATAGGAAA CGTTTACGTC ACTGTATTTC TCTTTAATTG TATTTAATGT TGATTCTGAG 3180 

ATTGCTTTAT TTAATGGTAT TACAACCAAG CATTTATCTT CATCTATCTT AATAAATTCA 32 4 0 

TCTTTACAGT CTAATTTCAT ATCGTTGATA TCATTGATGA AATGATTTAC TTGTGCTTTA 33 00 

GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 3360 

TTTATATGAA TACCTTTAAT TTTAATTGTA CCACCACCGA TTGAAATACC GATAATTTCA 34 20 

ATGTAGCGAC CATCATTACG AGATGATTTG ATATAAGCAC AGTTTGGATG TTGACCAATA 34 80 

CTATCGCCTT CTTCTTCGAT GATATCTATT TT AAT AC CAT CATCAGCTGC AATTTCTAAT 3 54 0 

GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTG CTCCACCCAC AATAGCGACA 3 600 

TCTGTACCAT GTCCTTGGTG TGTTTGAGCA AATGATTCAT AATAATGTAT TTCAATATTT 3660 

TTATATCTCC CAATATTGCG CGTGCTGAAT TCCCCTTTAC TGCACCAGCC GTATGAGAAC 37 20 

TTGAAGGGCC CAT 37 33 
(2) INFORMATION FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 525 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 

TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 60 

GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 120 

TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 180 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 300 

CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 36 0 

5 CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAa GAGGAA AaTGTAaGaT 42 0 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 4 80 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 52 5 

10 

(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 08 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 



TTGATTTGCT 


ACAAAGTATC 


TnCTCATTTC 


TGTATCCTGA 


AAAATCTTTA 


GTGTAATAAT 




GTTGTTCAGT 


TTTAATATTT 


TCAGTCATAG 


TTGACTACCT 


CCGTATATTT 


TGATTTAATT 


120 


AAGTTGTATA 


TTTTGATGAA 


CACTTATTGT 


TACTTGTTGG 


CGCAAGTAGC 


AGTTTTTTCA 


180 


TTCTTCATAA 


AAGTATTCCT 


TATAGAATAT 


GAATGTTGCG 


ATACTTGCGA 


ATCCTGCAAT 


240 


TGaCCATGCT 


GtAGTGAAGT 


ATAGAAACGG 


CATAAGTACA 


ATCGCTAAGA 


CTGTGAAGCA 


300 


TAGTACTGCT 


ACTAGGTAGC 


TTTTATAAAT 


GTTACTCATT 


TTCTTTTTTC 


AACTCCTCCA 


360 


TTATTCTCTG 


GTCTGATAAG 


TCGTGATAAG 


GGAATTTTTT 


CCtAGCTAAT 


TGGACTGGTA 


420 


TTCTGCCTCG 


TATCGCAATG 


TATCCTTCAT 


CTTCAAGCTC 


TTTATTCAGT 


TCTCTTATTA 


480 


TTTGTCCTGC 


TTTGGATTTA 


GAAACAGATA 


AAATTACCgC 


AAGTTCTTTA 


GCTTGCAAAC 


540 


TATTTTTCAT 


CATATCTTTT 


CCTCCTTTAA 


AATAACTGTT 


GATTCTCTGG 


GTTATCTGCT 


600 


TCGTAATTAT 


CTGCAATAAT 


ACTTTTAGCG 


AAAAAGTCCA 


AACTGACCTT 


ATATAGGTTG 


660 


TTCATAGATT 


TCTTTACGTT 


AACCCCTTCC 


TCAAGTACAT 


AAGGCACCCT 


AAAATCATTT 


720 


ATAAACAGTC 


CGTTTTCGTC 


TAAAGTAACG 


GTTGGTAATT 


CAGGTTTGTT 


CCGTCTATAA 


780 


ACTTCTCCTA 


GTGTAGGTTT 


TTGCTTTTCA 


GCTTGTTTAG 


TGAAGTCGGA AAATGCCTTA 


B40 


AGTAGTTTTA 


TTCCTGAATC 


AGGATCACTG 


TGTCGCTCAA 


TCGTTTCTGC 


TGTAGACTCT 


900 


TTACTAAAAT 


CATTTCTATT 


GATTACAGGC 


TTTCTCGTAT 


TTCGTTCAAT 


CTTCCAAACC 


960 


TTCCACGTCA 


CAACTGCCAT 


TGTGGTGAGG 


AGGGTTGTTT 


TGTATAGTGC GTTCATTTGT 


1020 


AATTCCTCCT 


ATTAAGTTGT 


TTGTTCAATT 


GTGTGTGTTA 


TTCTTCTTCG 


TCTAAATCAA 


1080 
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CGACTTCGTA AGTGTGCTCA ATCTCGCCTG cATATGTCAC AGTAAGAGTA TCTTTGTGTG 1200 

TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 126 0 

TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AG CyTCATAA CCGAATTCAG 1320 

TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 13 80 

CTTCTCTGCC ATCAGAACGC ACAATTGT 14 0 8 
(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 60 

AAGCAGTTAT TGACTGAAAA T CTACTTCTA AGAGAGCACT TTATTTAATT ACTTAAGAAA 120 

TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTT TAGG CTTGTT 180 

ATT AG T AAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA TGTTACCACT 240 

TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 300 

CATAC CG ATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 360 

TACAGTGAAA ATTGCTG CTG CTGTTTCTG c ATCAACTGGG nCAACCTCAC CATTAACTGT 420 

TGTTGGACAC AT 432 
(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

ATAATCATGA AGTnG CTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 6 0 

AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACAr G 12 0 

50 GAAGCGATAA AATTAAT CAT GCTATGATTG AAAAATTAG C TAAAAGTAAT GCCTCAACGC 180 

AACATTACAC ATATCATAAA CTGAATACGT TACAATCTTT AGATCAACGT ATTGCAAATA 24 0 
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GTATAAAAAG TCAACGAAAT ATTATTTTGG AAGAACTTGC ACGTACTGAT GATAAAAAGT 36 0 

ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CGAGGCAGTT AAAATTCTAA 420 

5 

AAGATATACG TGTTGATGGT AAAACAGATC AACAAATTGC AGATCAAATT ACTCGTCATA 4 80 

TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 54 0 

AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAA CGAA ATTAGGAAAA GCTGAAGCAG 6 00 

ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGAC CAAT 66 0 

TGAAGAAACA TTTTGCATCA ACTGGCGACA CGTCTTCAGA TGATATATTA AAAGCAATTT 72 0 

15 TGAATAATGC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CG TATAG AAA 78 0 

GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AG r AACAG AT CAAAATAAAA 84 0 

TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

20 AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 960 

CAAGTTTACT AGATCGATTG AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTTAGCAA 102 0 

ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 108 0 

25 CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATG CTGAT 114 0 

GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCGATATCT 12 00 

GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CACGACGCAA TAAGTTGAAA 126 0 

30 

AATAAAG CAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 132 0 

TCTTATTTCC CCTTAATTTA TTG TGAAGTA TATGCAAAAT GCAATGAATA GATTGTCCAT 1380 

CATTTTAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 144 0 

35 

TGTCTCGAAA AACGGCGCTA TTAGTTTTGG AT ATG CAAGA AGGTATAGCG AGTAGTGTAC 1500 

CTAGAATAAA AAATATTATT AAAGCGAATC AG AG AG CAAT TGAAGCAGCA AGACAACATC 1560 

40 GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCCTCGA 162 0 

GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGATGCAT 1680 

CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCGAC 174 0 

45 GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACGTG CAAAT GATATTAATC 18 00 

ATTTAGTATT AACGGGTGTC TCTACAAGTG GAG CTGTATT GAG CACGGCA TTAGAAAGTG 186 0 

TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

50 AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 1980 

CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTTGAG 204 0 
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GAGGAACATT TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 216 0 

ATGtCCTCGG GTTTAATTGA ATAGAAATCG GACTTATGAA CGAAATATGT TTAAGTCGAA 222 0 

5 CTCCTTGTTT ATACTTATAA ATTTTACGGG TTTAATATAA TACTTATTTA CCTGTAATAT 22 80 

ATG CATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG TTCCTATGCA GCAAATGCAT 23 4 0 

ATCCTAATCC TTTAACATTG GCATTnCTGC AAATGAACGC ATAGAATCCA TTCACTGTTA 24 00 

10 

ACTTTTTnCA ACAAATGTCT nACATG 2426 
(2) INFORMATION FOR SEQ ID NO: 543: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

GAGTTGGGGA ATGTGCTCAA AATATGCGGA CTTTATGCAT TyCGGAATTG sCCaATTGCA 60 

25 GCTTTAAGCT ATGGTCAAAA AAAGAGGGTC ACTATAGCAT CTGTTTTAGT CTTAAATCCG 12 0 

GAAATAATCA TATTGGATGA ACCGACTGCT GGTCAAGATT TCTATCATTA TAATGAGATA 180 

ATGTCATTTT TAATTGAACT AAACAGACAG GGGAAGACGA TTATTATGAT TACGCATGAT 240 

30 

ATG CATTTAT TGTCTGAGTA TAGTTCAAGA ACAGTTGTAT TATCAAAAGG TCAAGTCGTT 300 

GCTGATACCA CGCCAgTATT GGTTTTAAAT GATAAAAAAA TCTGTGAGAT TGCATCATTG 36 0 

AGACAAACAT CGCTATTTGA AATGGCCGAA TATATAGGGA TTAGCGAGCC ACAGAAATTA 420 

35 

GTACAATTAT TTATTAACCA TGATAGGAAG GTGAGACGcC AATGAATCAA TATAATACTA 480 

TAGGTTTTCA CC CGGGAAAT AGTCGTATTC ATCAATTAAA TGCGACTGTT AAACTTTTAT 54 0 

4Q TCTTATTAGT TGTTTCTATT TCTGCAATGG TGACTTATGA CACAAGATAT TTAATTTTAA 60 0 

TTAGTGCTTC ATCTATTTTA TTGGTCAAAT ATGCTCATAT TGAATGGAAA CAAGTTCGCT 660 

TTGTTGTTAA ATTCATTCTG TTTTTCACAA TAtTAAATAT TATTGCCGTG TACATATTTG 72 0 

45 ACCCTGAATA TGGTGTGAAG ATTTATAATC AGCGTACAGA GTTAGTCAAT GGTATTGGTC 780 

GATTTACGCT AACATCACAG GAATTATTCT ATCTTTTTAA TCTAATATTA AAATATATTA 84 0 

GTACAGTTCC TTTAGCGTTA ATATTTTTAT TCACAACGAA TCCGAGTCAT TTTGCTGCAA 90 0 

50 GTTTAAATCA GCTAGGTGTG AATTATAAAA TCAGTTACGC AGTCTCACTA GCATTAAGGT 96 0 

ATATTC CAGA TATTCAAGAA ACATATTttA ATATTTCACA AGCGCAACAA GCAAGAGGAT 102 0 

55 
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TACCTTTAAT ATTTTCTAGT ATCGAAAGAA TTGACACTAT TAGTACTGCT ATGGAGTTAA 114 0 

GACAATTCGG ACAGTATAAA AGGAGAACCT GGTACGTCAA AAAACAATTA AAAAAAGATG 12 00 

ATTATGTTGT TTTGTGTTTG ACGTTAATAC TTCTGATGTT AGTAGTTACA TTATT CTTTT 12 60 

TAAATAATAG TCGATATTTC AACCCGTGGC ATTAGTATTC ATATAAATAG TCTTTAAATA 13 20 

GAAATAGGAG GGAGACATTT AATGATAAAT ACTGAAAGAT TAAATTTAAT GATTCCAAGT 13 80 

TCCTCGCATT TAATTGAACT TTATAATATT TGTAGTCATC CACAAGCAAA TATATACACT 14 4 0 

CCCAAAGGTT TACATAATTC CAAATTAGAC ACACAACGGT GGATTGAAAA ATGGCGAAAC 1500 

CATTGGCAAC AATATCAATT TGGTTACTTT GTATTGGTAA AAAAAATAGA TTGTAGTGTT 156 0 

ATTGGTATTT GTGGATATGA ATATCGACAA TTAAAGCAAG AaACAGTATT AAATTTATTT 162 0 

TATAAATTAC ATCCAAGTTT TGAAGGACAA GGGTACGCAT GTGAGGCTAT TACAGCAATC 16 8 0 

ACAAATTTTG TGAATTATAT CGATCAAGAA ACAGTAAAAG TTATCAGGAC AAATAAGTGT 174 0 

AACCAACGTT CAATAAATTT AGCAGAAAGG CTTAAATTCA AGCGAGACGA TACTATGGAC 1800 

GACATTATCA ATCAAGGAGA TATTGTG TTT TAAnAATAAA ATACTATGAC ATTATCTAAA 18 60 

AAATAAAATT AAAA 1874 
(2) INFORMATION FOR SEQ ID NO: 544: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 
TCAACATTTC TAACACCAAT GTGAAAATGA TCTATGTGAT TTGCAATGGC TTGATTTGTA 60 

ATATGTGTGC CTAAATGACC TGTAGCACCT GTTAACATAA TATTCATTCA CTTCATCTCC 120 

TAATCTTTAT ATACATAACA TAATACTTAT TTGATGGTTT TCAAAACATT TGATTTTATA 180 

AAAAATTCTA ATCTGTATTT ATTGTCGACG TGTATAGTAA ATACGTAAAT ATTATTAATG 24 0 

TTGAAAATGC CGTAATGACG CGTTTTAGTT GATGTGTATC ACTAATATCA TTGAAAATTT 300 

TAATC aGGTA CTACGACAAT ATGATGTCTG TTTTGTGTCT GAAAGTTTTA CAGTTTTTAA 3 60 

AATAAAAATG G T AT AAAGTG TGATTTGTAT AAAAAAGAGT CT CGACGGAT AAGAATTGAT 420 

TAATAACAGT TAGCATTTTA TTAATTACCT TAACAATGAT TCAAGTTTAG TTAAATGAGG 4 80 

TTTAATTTGA AAGGGGATAG CGCCTCAATA TAATGTAGGT AGATTGTTCA TATTAC GTAA 54 0 
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AAGCAAGTTT AG CATTGGGA ATGTTAGCAA CAGGTGTAAT TACaTCGAAT GTACAATCAG 6 60 

TACAAGCGAA aGCAGrAGTT AAaCAACAAa GTGAaTCAGA GTTAAAACAC TATTATAATA 7 20 

AmCCAATTTT AGAGCGTAAA AATGTGACTG GATTTAAATA TACTGATGAG GGTAAACACT 7 80 

ATTTAGAAGT CACAGTAGGG CAACAGCATT CTCGAATCAC TTTACTTGGA TCTGATAAAG 84 0 

ATAAATTTAA AGACGGAGAA AACTCAAATA TAGATGTGTT TATCCTTAGA GAAGGTGACA 900 
GTAGACAAGC AACAAATTAC TCAATTGGTG GCGTTACAAA ATCAAATAGT GTGCAGTATA 96 0 

TTGATTATAT CAATACGCCA ATTTTAGAAA TCAAGAAAGA TAATGAAGAT GTACTTAAAG 1020 

ATTTTTACTA CATTTCAAAA GAAGACATCT CATTAAAAGA ACTTGATTAT AGATTAAGAG 10 80 

AACGTGCGAT TAAACAACAC GGCTTGTATT CAAATGGTCT TAAACAAGGT CAAATTACAA 114 0 

TTACAATGAA TGATGGCACA ACACATACAA TCGATTTAAG TCAAAAACTT GAAAAAGAAC 12 00 

GTATGGGTGA GTCAATCGAC GGCACTAAGA TTAATAAAAT TCTAGTAGAA ATGAAATAAT 12 60 

ACTTTCTAAC AACAAAGCGC TATGTTGAAT AGTGCTTGTT ATGGAAATAT ATGGAAGTTA 13 20 

AGCGACGTAC TGTTGCTTAG CTTCTTTTTT TGAGGGGAAA AGTTACAAAA CTCACACAAA 13 80 

CAGTCGCACC ACGCATTATC TTTTGCTTAA ATAGCTTAAT CATATTTTAT GAATAGTTAA 14 4 0 

AAACAGGTTA ATGTGAATAT CCGAATACAG CTCCTATAAT ATGGGTGTAT GATTCAAATT 15 00 

ACGTAATAAA ACAATCTAAT TATAATAGAT TGGAGCATAC AACTATGAAA ATGAAAAATA 156 0 

TTGCAAAAAT AAGTTTGTTA TTAGGAATAT TAGCAACAGG TG T AAA CACT ACAACGGAAA 162 0 

AACCAGTTCA TGCCGAAAAG AAACCTATTG TAAT AAGTG a AAATAGCAAA AAATTAAAAG 16 80 

CTTATTATAA TCAACCTAGT ATTGAATATA AAAATGTGAC AGGTTATATC AGTTTCATTC 174 0 

AACCAAGTAT TAAATTTATG AATATCATAG ATGGTAATTC TGTTAATAAT ATTGCTTTAA 180 0 

TTGGCAAAGA TAAGCAACAT TATCATACGG GTGTACATCG TAATCTTAAT ATATTTTACG 18 60 

TTAATGAGGA TAAGAGATTT GAAGGTGCAA AGTACTCTAT TGGGGGTATC ACGAGTGCAA 1920 

ACGrTaAAGC TGTCGACCTA ATAGCAGAAG CAAGAGTTAT TAAAGAAGAT CATACTGGTG 1980 

AATATGATTA TGACTTTTTC CCATTTAAAA TAGATAAAGA AG CGATGTCA TTGAAAGAGA 204 0 

TTGATTTTAA ATTAAGAAAA TAC CTTATTG ATAATTATGG TCTTTACGGT GAAATGAGTA 2100 

CAGGAAAAAT TACAGTCAAA AAGAAATACT ATGGAAAGTA TACATTTGAA TTGGATAAAA 2160 

AGTTACAAGA AGACCGTATG TCCGATGTTA TCAATGTCAC AGATATTGAT AGAATTGAAA 2220 

TCAAAGTTAT AAAAGCATAA CACATATACT TGATGACGAA ATAAGTTGAA ATTGAAATAG 22 80 

AG AGG TTAAG TGACGAT CAA ACGTTGCTTA ACTTCTTTTT AATGCTTAAA AATTATTT CA 234 0 



1480 



EP0 786 519 A2 



TTAATAATAC 


TTCAATAATT 


GTTAAAAGGG 


GTTTAATGTG ATTATCTTAG 


AACGCCATCT 


2460 


ATAATGATGT 


TGTATGATTC 


AAATTACGTA 


AAAAGACAAT CGAATATAAT 


ATAGATTGGA 


2520 


GCATACAATT 


ATGAAAATGA 


GAACAATTGC 


TAAAACCAGT TTAGCACTAG 


GGCTTTTAAC 


2580 


AACAGGCGCA 


ATTACAG TAA 


CGACGCAATC 


GGTCAAAGCA GAAAAAATAC 


AATCAACTAA 


2640 


AGTTGACAAA 


GTACCAACGC 


TTAAAGCAGA 


GCGaTTAGCA ATGATAAACA 


TAACAGCAGG 


2700 


TGCAAATTCA 


GCGACAACAC 


AAGCAGCTAA 


CACAAGACAA GAACGCACGC 


CTAAACTCGA 


2760 


AAAGGCACCA 


AATACTAATG 


AGGAAAAAAC 


CTCAGCTTCC AAAATAGAAA 


AAATATCACA 


2820 


ACCTAAACAA 


GAAGAGCAGA 


AAACGCTTAA 


TATATCAGCA ACGCCAGCGC 


CTAAACAAGA 


2880 


ACAATCACAA 


ACGACAACCG 


AATCCACAAC 


GCCGAAAACT AAAGTGACAA 


CACCTCCATC 


2940 


AACAAACACG 


CCACAACCAA 


TGCAATCTAC 


TAAATCAGAC ACACCACAAT 


CTCCAACCAT 


3000 


AAAACAAGCA CAAACAGATA TGACTCCTAA ATATGAAGAT TTAAGAG CGT 


ATTATACAAA 


3060 


ACCGAGTTTT 


GAATTTGAAA 


AGCAGTTTGG 


ATTTATGCTC AAACCATGGA 


CGACGGTTAG 


3120 


GTTTATGAAT 


GTTATTCCAA 


ATAGGTTCAT 


CTATAAAATA GCTTTAGTTG 


GAAAAGATGA 


3180 


GAAAAAATAT 


AAAGATGGAC 


CTTACGATAA 


TATCGATGTA TTTATCGTTT 


TAGAAGACAA 


3240 


TAAATATCAA 


TTGAAAAAAT 


ATTCTGTCGG 


TGGCATCACG AAGACTAATA 


GTAAAAAAGT 


3300 


TAATCACAAA 


GTAGAATTAA 


GCATTACTAA 


AAAAGATAAT CAAGGTATGA 


TTTCACGCGA 


3360 


TGTTTCAGAA 


TACATGATTA 


CTAAGGAAGA 


GATTTCCTTG AAAGAGC TTG 


ATTTTAAATT 


3420 


GAGAAAACAA 


CTTATTGAAA 


AACATAATCT 


TTACGGTAAC ATGGGTTCAG 


GAACAATCGT 


3480 


TATTAAAATG 


AAAAACGGTG 


GGAAATATAC 


GTTTGAATTA CACAAAAAAC 


TGCAAGAGCA 


3540 


TCGTATGGCA GACGTCATAG 


ATGGCACTAA 


TATTGATAAC ATTGAAG TGA 


ATATAAAATA 


3600 


ATCATGACAT 


TCTCTAAATA 


GAAGCTGTCA 


TCGGAAAAAC AAGAAGTTAA 


GTGACAACGG 


3660 


TTTACATGTT 


GCTTAGCTTC 


TTTTATTATG 


CGTAATGATG TAAAAAGACG 


AATATTCATT 


3720 


TGTTTGTAAA 


AGTGGCATTT 


CTATGTCTTA 


AAAGTGACGA AACTTCAAAT 


GTGCCAAGTG 


3780 


TTGAATCACA 


TCAAAATCAT 


TTTTATTTAA 


CGAACATTAT GGATTTCTTA 


ATTTACTTAA 


3840 


CGATGATTCA 


AATATAGTTA 


AACAAGGTTT 


AATGTGAATG GAGCAATACG 


CCATCTATAA 




TAAAGCTGTA 


TGATTCAATG 


AATGTAATCG 


AACAAATCTA ATAATTACGA 


ATGGAGCATA 


3960 


CAACTATGAA 


AATAACAACG 


ATTGCTAAAA 


C AAGTTTAG C ACTAGGCCTT 


TTAACAACAG 


4020 


GTG TAATCAC 


AACGACAACG 


CAAGCAGCAA ACGCGACAAC ACTATCTTCC 


ACTAAAGTGG 


4080 


AAGCACCACA 


ATCAACACCG 


CCCTCAACTA 


AAATAGAAGC ACCG CAATCA 


AAACCAAACG 


4140 
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CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC GCCACAACCA ATGCAATCTA 4 260 

C T AAAT CAG A CACACCACAA TCGCCAACCA CAAAACAAGT ACCAACAGAA ATAAATCCTA 4 320 

AATT TAAAG A TTTAAGAGCG TATTATACGA AACCAAGTTT AGAATTTAAA AATGAGATTG 4 380 

GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA TGTTGTCCCA GATTATTTCA 4 44 0 

TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAATA TGGTGAAGGA GTACATAGGA 4 500 

ATGTCGATGT ATTTGTCGTT TTAGAAGAAA ATAATTACAA TCTGGAAAAA TATTCTGTCG 4 560 

GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA AGCAGGAGTA AGAATTACTA 4 620 

AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA ATTCAAGATT ACTAAAGAAC 4 680 

AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA ACTTATT GAA AAAAATAATC 4 74 0 

TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT GAAAAACGGT GGAAAGTACA 4 80 0 

CGTTTGAATT GCACAAAAAA TTACAAGAAA ATCGCATGGC AGATGTCATA GATGGCACTA 4 860 

ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA TTCTCTAAAT AGAAGCTGTC 4 920 

ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT TGCTTAGCTT CTTTTGTTAT 4 980 

GTTCGATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA TATGACGTGG AAGAGACCTG 5 04 0 

AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA ACGGGATCAG TTTTATTTAA 5100 

CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA ATGATTATTA AACATGGTTT 5160 

AATGTGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA TGATTCAATA GACGTAAGCG 5220 

AACAAATCTA ATAATTACGA ATGGAGCATA CAACTATGAA AATGACAGCA ATTGCGAAAG 5280 
(2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 5: 
AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 60 

AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA T AAA CAG CCA 120 

GCTTCATCCG AAAAAgGTTC AAGTGGTCAT CATGGTAATG CAG CAATGCC TCAAACCAmA 180 

TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 24 0 

AAGCCGTTTT CAATCGTATA TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 3 00 
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GATCAATATA GCGGTAAAAA GCTAGGTACG ATTAAATATG ATGACTACGG TATTATTGCT 4 20 

AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA 4 80 

ATCATTAATT TATTTGTATG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 54 0 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC AT CG CGT AAA AAAACCAGCA GOO 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTGG AT TA 6 60 

TCACTTATCC TTGTATTTAT AATTGAATTA ATATTATATA TTAAAGATCG TCGTGCTAAA 72 0 

CAATAATGCA CTTAAAGTTT TGAACTGACG AAATTTACAA AATGgATTCT CGTCTCTCTA 78 0 

ATTACtTAAA ACGGGgTt Cy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 84 0 

CTGtCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 88 6 
(2) INFORMATION FOR SEQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 



30 


GGCATTTGTG 


TCCTTATATA 


AGGAACTGTG 


t T AAATA CAT 


TACTGTTGTT 


AAGTTGTTTT 


60 




TGTAATTCAA 


AGAGCAGAAC 


AGAGTAACAT 


CATCAGTTGT 


AGTAAACGAT 


AATCCGGTAA 


120 




AACAACTAAA 


TGAAATAATG 


AAAGTCATTT 


AACCTGAACA 


TTAAAATATA 


TTTGTTTTTC 


180 


35 


ATTAAGAATA 


ATTCAAGTAT 


ATTTAAATCG 


AGGTTAATTA 


TCGTATGAAA 


CGATGCACGT 


240 




TATAATAAAA 


ATGTATGATT 


CAAATTACGT 


AATGAAAACA 


ATCCAATATA 


TTAAGA TTGG 


300 




AGCAAATAAA 


TATGAAATTT 


ACAGCATTAG 


CAAAAGCGAC 


ATTAGCTTTA 


GGAATTTTAA 


360 




CAACAGGAAC 


TTTAACAACA 


GAAGTTCATT 


CAGGTCATGC 


AAAACAAAAT 


CAAAAGTCAG 


420 




TAAATAAACA 


TGACAAGGAA 


GCATTATACC 


GATACTACAC 


TGGAAAGACT 


ATGGAAATGA 


480 


45 


AAAATATTAG 


TGCTTTGAAA 


CATGGTAAAA 


ACAACTTACG 


TTTTAAGTTT 


AGAGGTATTA 


54 0 


AGATTCAAGT 


TTTACTGCCT 


GGAAATGATA 


AAAGTAAATT 


TCAACAGCGT 


AGTTATGAGG 


600 




GGTTAGATGT 


TTTCTTTGTT 


CAAGAAAAAA 


GAGATAAGCA 


CGATATATTT 


TATACTGTTG 


660 


50 


GTGGTGTAAT 


ACAGAATAAT 


AAAACATCTG 


GAGTTGTCAG 


TGCACCAATA 


TTAAATATTT 


720 




CAAAAGAAAA 


GGGTGAAGAT 


GCTTTTGTGA 


AAGGTTACCC 


TTATTACATT AAAAAAGAAA 


780 




AAATAACACT 


AAAAGAACTG 


GATTATAAGT 


TGAGAAAGCA 


TCTAATTGAA AAATACGGAC 


840 
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ATAACCTTGA TTTAAGATCT AAATTAAAAT TTAAATATAT GGGGGAAGTC ATAGAAAGCA 960 

AACAAATTAA AGATATTGAA GTTAACTTAA AGTAAATCAT TACGAATAAT TAAAAGTAAT 102 0 

5 TGAAGCGGCT TAACGGTGAA ATGTAAATTG GTGCGCATAG CTTATACAAA AAGG ATG CAT 10 BO 

CAATCGATAT CGTCGTTAAG CCGTTTTGGT TTGTGTGTCA TGAATCCTAT CCCAATCTCC 114 0 

ATAAAGGTAA AATTTCCACC ACCAACATCA AAATTCTCCA CATCG CAACA TAAC CAAATG 12 00 

10 

TTATAATAAA TCTATTACAC AAAGAGATAA ATTACTTATT CAAAGGCGGA GGAAT CACAT 12 SO 

GTCTATTACT GAAAAACAAC GTCAGCAACA AGCTGAATTA CATAAAAAAT TATGGTCGAT 1320 

,5 TGCGAATGAT TTAAGAGGGA ATATGGATGC GAGTGAATTC CGTAATTACA TTTTAGGCTT 1380 

GATTTTCTAT CGCTTCTTAT CTGAAAAAGC GGAACAAGAA TATGCAGATG CCTTGTCAGG 14 4 0 

TGAAGACATC ACGTATCAAG AAGCATGGGC AGACGAAGAA TACCGTGAAG ACTTAAAAGC 15 00 

20 AGAATTAATT GACCAAGTCG GTTACTTCAT TGAGCCAGAA GATTTATTCA GTGCGATGAT 15 60 

TCGTGAAATT GAAACGCAAG ATTTCGATAT CGAACACCTG GCGACGGCAA TTCGTAAAGT 1620 

TGAAACATCA ACATTAGGTG AAGAAAGTGA AAATGACTTT ATCGGTCTGT TCAGCGATAT 16 80 

25 

GGATTTGAGT TCAACGCGAC TAGGTAACAA TGTCAAAGAA CGTACTGCTT TAATCTCTAA 1740 

AGTCATGGTT AATCTTGACG ACTTACCATT CGTTCACAGT GACATGGAAA TTGATATGTT 18 00 

3Q AGGTGATGCA TATGAATTCC TAATTGGGCG CTTTGCGGCG ACAgCGGGTA AAAAAGCAGG 18 6 0 

CGAGTTCTAT ACACCACAAC AAGTAT CTAA GATACTGGCG AAGATTGTCA CAGACGGTAA 1920 

AGATAAATTA CGTCACGTGT ATGACCCAAC ATGTGGTTCA GGTTCACTGT TGTTACGTGT 1980 

35 TGGTAAAGAA ACACAAGTGT ATCGTTATTT CGGTCAAGAA CGTAACAATA CTACATACAA 204 0 

CTTAGCACGC ATGAATATGT TATTACATGA TGTGCGTTAT GAGAACTTCG ATATCCGTAA 2100 

TGATGACACA TTGGAAAACC CAGCCTTTTT AGGCAATACA TTTGATGCGG TTATTGCGAA 2160 

40 

CCCACCGTAT AGTGCGAAAT GGACTGCAGA TTCAAAGTTT GAAAATGACG AACGATTCAG 22 2 0 

TGGTTACGGC AAACTTGCGC CTAAGTCTAA AGCAGACTTT GCCTTTATTC AACACATGGT 22 8 0 

ACATTACCTA GACGATGAAG GTACCATGGC CGTTGTACTC CCACATGGTG TATTATTCCG 234 0 

45 

AGGTGCTGCA GAAGGTGTCA TTCGTCGTTA TTTAATTGAA GAAAAGAACT ACTTAGAAGC 24 00 

TGTGATTGGT TTGCCAGCGA ATATTTTCTA TGGGACAAGT ATTCCAACAT GTATTTTAGT 24 60 

50 ATTTAAAAAA TGTCGCCAAC AAGACGACAA CGTACTATTT ATCGATGCAT CCAATGATTT 25 20 

TGAAAAAGGA AAAAATCAAA AT CATTTAAG CGATGCCCAA GTCGAACGTA TTATAGACAC 25 8 0 

ATATAAGCGT AAGGAAACAA TTGATAAATA TAGCTACAGC GCGACACTAC AAGAGATTGC 2 64 0 

55 
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GATTGATTTA GATCAAGTCC AACAAGATTT GAAAAATATC GATAAAGAAA TCGCAGAAAT 2 760 

TGAGCAAGAA ATCAATG CAT ACCTGAAAGA ACTTGGGGTG TTGAAAGATG AGTAATACAC 2 8 20 

AAAAGAAAAA TGTGCCAGAA TTGAGGTTCC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 2 880 

AGTTAGGGGA TCTTACAGAT AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 2 94 0 

CTTTAACAAT ATCCGGACAG TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 3 000 

TTTCGTCGAA AAATCTAGAA AATTATACAC TAATAAAGAA TGGAGAATTC GCGTATAACA 3060 

AAAGTTATTC TAATGGATAC CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 3120 

GTGTATTGTC CTCTTTGTAT ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAGACTTCA 3180 

TGGAAGCATA TTTTGATTCG ACACACTGGT ATAGAGAAGT TTCTGGAATT GCAGTTGAGG 324 0 

GTGCAAGAAA T CAC GG ATT A TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 3 3 00 

AATATCCAAG TTTAGAAGAA CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 3360 

AAATTGAATT AGAAGAACAA AAGCTTGAAT TACTTCAACA ACAGAAAAAA GGCTATATGC 34 20 

AGAAAATTTT CTCACAGGAA CTGCGATTCA AAGATGAGAA TGGTGAAGAT TATCCAGATT 34 80 

GGGAAAATAG CAAAATAGAA AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 3 54 0 

AAATGCTTTC AGTAACTATA AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 3 600 

ATAATTCAAG TAAAGATAAA AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTGCATATA 3660 

ATTCTATGAG AATGTGGCAA GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 3720 

CTGCATATAC TGTGCTTTAT CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 3780 

TTAAAACACA TAGAATGATT CATAAATTTA AAATTAATTC ACAAGGATTA ACATCAGATA 3 84 0 

CATGGAACTT AAAATATAAA CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 3 900 

AACAAGAAAA GATAGGTGAT TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 3 960 

TGAAAATTGA AATATTAGAA AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 4 020 

TTTGATAAAT ACATAGATTG CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 4 080 

AAAGTAAAGT GAATTAAAAA CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 4140 

TTTGTGTCGA AATTGTGTAC AGAATAAGTA GTTAAATAAA GATTAAGTTG AGATAAAGTG 4 200 

TTATTCGTAA ATAAAAGAGA GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 4 260 

TTAAATTACT TAATAATGAT TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 4 32 0 

TTGCTCATTA TAATGA 4 336 
(2) INFORMATION FOR SEQ ID NO: 547: 
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(A) LENGTH: 487 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 
TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTTCAGGTT TAATTTCAAA 60 

GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 120 

ATAACATAAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 18 0 

AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGaAT TCGTTATGAT CAACTAAGAT 24 0 

AACATCTTGA C CAT CTAAAT CATCTGTTAA T AATTC CGGA GCAGGTACAT TAAATGTATC 3 00 

TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 360 

ACCTCGAAGT TGTTCaAATT CTGCCaTAAT AATCGCAGAT GAAATTGCAT CAGTGkCTGG 4 20 

aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 4 80 

TTCTTTA 4 87 



(2) INFORMATION FOR SEQ ID NO: 54 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 8: 



TTGGTGGTGC 


AGCAGTTATA 


GCAATTGTTT 


TAGCATTCAT 


TGGTAAGTTC 


ACTGCATTAA 


60 


TTTCTTCTAT 


ACCTACmCCA 


GTTATGGGAG 


GAGTATCTAT 


ATTACTTTTC 


GGTATTATTG 


120 


CAGCAAGTGG 


CTTAAGAATG 


TTAGTTGAAA 


GCAAAGTAGA 


TTTTGCGAAC 


AATCGAAATT 


180 


TAGTTATAGC 


TTCTGTAATT 


TTAGTTGTAG 


GTATCGGTAA 


TTTAGTATTT 


AACTTAAAAG 


240 


AAATTGGTAT 


CAACCTTCAA ATTGAGGGGA 


TGGCATTAGC 


TGCACTTTCA 


GGAATTATTT 


300 


TGAACTTAAT 


CTTACCTAAA 


GAGAAAAAAC 


AAAACAATTA 


AGATTTACAA 


ATTAAGGAGG 


360 


GCGCTTTTAT 


GAATCATTTA 


TTATCAATGG 


AACATTTATC 


TACAGATCAA 


ATATACAAAC 


420 


TTATCCAAAA 


GGCAAGTCAA 


TTTAAATCTG 


GTGAACGTCA 


ACTACCAAAC 


TTTGAAGGGA 


480 


AaTATGTCGC 


AAATTTATTC 


TTTGAAAATT 


CTACTCGrAC 


AAAATGTAGT 


TTTGAAATGG 


540 


CAGAACTTAA 


GCTAGGGTTA 


AAAACGATTA 


GCTTTGAAAC 


ATCAACATCA TCTGTTTCAA 


600 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTG 720 

CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 780 

TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 84 0 

ATTCACGTGT CGCACtAGTA ATTACCAnAG T 8 71 



(2) INFORMATION FOR SEQ ID NO: 54 9: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
, 5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 9: 

20 TCAATCTAAT AT ATACTICA TGACTTCCCG CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 6 0 

TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 120 

ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 180 

25 CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT AATTTGTTTG TTTTCAAAAT 24 0 

TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 3 00 

CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 3 60 

30 

CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 4 00 
(2) INFORMATION FOR SEQ ID NO: 550: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 
ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 6 0 

45 

ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 120 
CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 180 
TGTTT AAA CC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 24 0 

so 

AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 3 00 

TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 3 60 
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TTTGTTTTAA TGCATCAATG AGTGCTGTTT GATTTTCAAC AATTGGACCT GGCAACTCTT 4 80 

TTTTATAATC CATGTAAAAA CCTCTAAGCT CATCGCCATA TTTATCTAAG TCATATGCAT 54 0 

5 

AGAAAATTTG CGGACGCTTT AATACACCGA AGTCG AACAT GACAGATGAG TAGTCGGTAA S00 

CTAACGCATC GCTGATTAAG TATAAATCCG AAATGTCTTC ATAAt CTGAA ACGTCTTTCA 66 0 

CAAAATCATC ATGTTCATCA ATACGTGTCA CAACTAAATA ATGCATGCGT AAkAAAATAA 72 0 

CATArTCATC ATCCAGCGCT TGACGCAAAG CTTCTATATC AAAGTTAACA TTAAATTGAT 780 

ATGAACCTTC TCGAATCGCT TCATCGTCAC GCCAAGTTGG CGCGTACATA ATCACTTTTT 84 0 

,5 TATCTAATGG AATATTTAAT CTTGTCTTAA TACCATTAAT ATATTCAGTA TCATTGCGTT 900 

TATGTGATAA TTTATCATTT CTTGGATAAC CTGTTTCCAA AATCTTATCT CGACT AACAT 960 

GAAATGCATT TTGAAATATC GATGTCGAAT ATGGATTAGG TGACACTAGA TAATCCCACC 102 0 

20 GTTGGCTTTC TTTTTTAAAG CCATCTTGGT AATTTTGAGT ATTTGTTCCT AGCATTTTAA 1080 

CGTTACTAAT ATCCAAACCA ATCTTTTTTA ATGGCGTGCC ATGCCATGTT TGTAAGTACG 114 0 

TCGTTCGCGG TGATTTATAT AACCAATCTG GTGTACGTGT GTTAATCATC CwCGCTTTCG 12 00 

25 

CTCTTGGCAT CGCTAAAAAC CATTTCATTG AAAACTTTGT AACATATGGT ACATTGTGCT 1260 

GTTGGAATAT GTGTTCATAT CCTTTTTTCA CACCCCATAT TAATTGGGCA TCGCTATGTT 1320 

CAGTTAAGTA TTCATATAAT GCTTTGGGGT TGTCGCTGTA TTGTTTACCA TGAAAGCTTT 1380 

30 

CAAAATAAAT TAGATTCTTG TTTGGCAATT TTGATAGTAA TTTAAAAGTC GTATATATAC 144 0 

TATGTTCTAT CAATTTTTTA ATTGTATTTT TAAT CATGTC GTACCTCCGA CGTGTTTTTG 15 00 

35 TAATTATATT AATATGTATG AGC 1523 

(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS -. 

(A) LENGTH: 4 923 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTG CTT TTTTAAATTT AATTT CGAGA 60 

50 TTCAATATTA GTTTATCTCA TTGTGGCATT AATTGATAAA ATTGTTTTAA TGTTATAAAT 12 0 

CAAAGTCtTC TTCAACAATT TCAATGTCTG CAT CAGAT CT ATGATATGTG AAAAAGCTAA 18 0 

TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 24 0 
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ATGTGGACAA TAAATCTTTT TTCTGCGGAT TATCTAACTC ATAATCAACA TGTGTCACAT 360 

TATAACGTGC TTTTTTAGAA AGGCTAGCTA AAATTTGCTC GTGAAAAGCA GTTAATGAAT 420 

5 CTAAATCTAA TTTGATTTGT AATAGGAAAT TGTTATTAAG TAAATATAAG TCGTTTTGAT 4 80 

AACGCGACAA TTTGTTTAAT ACTTCATAAG CTTGTCTAGT CGTCTGAACT ACTTCTCTGA 54 0 

AAAGTATTTT CTTTCTATTC TGTTGGTGAA TATGTTTTTT TGTAATAGGA CGTTCTTCGC 600 

w 

TATAGTAATC ATAAATTTTC TCTAACTTTT CGACACGTTG TTTTAAATTA TGACTATCTT 66 0 

GTTTAATATT ATTAAACTCC GTCGTATCAT TTAATACTAA TTTAAAC CAC ATAAAAATAT 72 0 

J5 CTGAGGATAT ATTTAATGAA TTATAGTAAA TTTTTGTTTC AAATTTAGGT GG TAG AAA CA 7 80 

CAAAGTTAAC TAGAGATGAA CTTATGACAC CAATCATTAC AAGTACAAAC CTGTAAAAGG 84 0 

CGGTAATATA GAAAGAACCG GTATGTTGTC CCATAATGAT TAATGCTGTT ACACTCGCCA 900 

20 AAGTAGCAAC ATGTGCTAAA TTAAATTTAA ATAAAAT AG C AATAAGTACT ATGACGGTAA 96 0 

CACCCATAAT GATAAAATTA TCACTAAAAA TTGTTACCAT TGTAACAGAT AGTATGGCAC 102 0 

CTATAATGTT ACCCAATGCT TGATCAGAAA CTGTTTTAAA TGAACGATAA ATACTAGGTT 10 80 

25 GCATTGCAcA ACAGCACTGA CACCAGCTAA GGCTTTCAGA CCAACATCAT CCGGTAGTAA 114 0 

AGAAGCGATA GACATAGCTA AAATAATGGC TATACCAGTT TTAAAAATCC GAGCTCCTAG 12 00 

TCTCAAAAAT AATGACGCCC CTTTTAAGTT TATTGAATAT CTAATATTCG TATTCATTAC 126 0 

30 

TGTTATACAC TTACTAGTTA CAAAATTCAA GCTTATTTAT AGTTGTTAAA ATAAATCATA 13 2 0 

CATAATACTG AT AG CGATGT AAAACTTTAG TCAGAGATTA AAATAGTATA AATTTGTAAA 13 80 

35 ATAAAAACTC ACATAGTGAC ATATCAAGTT AAACGTTAAT AGTTAACGAT ATAAAATGAA 14 4 0 

TCTACTATGT GAGCATTTGC TTTATTTTAA TTCAATTAAA AATATACTTC CTTAAAAGTT 1500 

ATTTCATTTG ACTAAAAGCA TAGTCTGCAG CTTTTAAAGT TTGTTTAATA TCTTCTTCTG 1560 

40 TATGTTCAGT TGTTAAGAAC CAAGCTTCAA ACTTAGAAGG TGCTAAATTG ATACCTTGAT 1620 

TTAACATTAA TTTGAAAAAT TTACCGAACG CTTCGCCGTC AGAATGTTCA ACTTGATCAT 16 8 0 

AATGTGTGAC TTTTTCATCT GTAAAGTACA ATGTTAAAGA TCCATAAATA CGATTAATTG 174 0 

45 TAGCTGTGAT ATTATGTTTT TCGATTAATT TAAG TAAACC TTCTTCTAGT TGTTGGCCTA 18 00 

AGCTGTCTAA TTTTTCATAA ACACCGTCTT GTTCTAGTAC TTCGAGTAAT GCAATACCTG 186 0 

CTTT CATAG A TAACGGGTTA CCAGCCATTG TACCAGCTTG ATATGCAGGT CCTAGAGGTG 1920 

50 

CTACTTGTTC CATAATATCT TGACGTCCAC CATAGCCTCC AATTGGTAAA CCACCGCCAA 1980 

CAATTTTACC AAATG CAGTT AAATCAGGGA TAACACCTAA TAAATCTTGA GCGGCACCGT 2 04 0 

SS 
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AAATTTCATT AACCTCTTCT AAAAATCCAG GTTGAGG CAT TACCATTCCA AAGTTACCAA 2160 

CAATTGGTTC TACTAATACT GCGGCAATTT CATCACCCCA AAATTCAATT GCTTCTTTAT 2220 

AGGCGTTAAT ATCATTGAAA GGTACAGTAA TGACTTCACG TGCGACGCTT TCTGGAACAC 2280 

CAGCTGAGTC TGGAGAACCG AG CTGAG ATG GGCCGCTACC TGCTGCAACC AATACTAAAT 234 0 

CAGAATGGCC ATGATAAGAT CCAGCAAATT TTATAATTTT ATTTCTTTTA GTATATGCAC 24 0 0 

GTGCAACACG AATTGTTGTC ATGACTGCTT CTGTTC CAG A ATTTACAAAG CGAATTTTCT 24 6 0 

CAAGAGATGG AATTGCATCA CGTAATTTTT TGCTGAATTC AATTTCTAAT TCAGTCGGTG 2 520 

TACCAAATAA AACACCTTTA GCAGCTTGTT CTTGAATTGC TTTAGTAATA TGAGGATGTG 2 580 

CATGCCCCGT AATAATTGGA CCGTATGCTT GAAGGTAATC AATAAATTTA TTGCCATCGA 2 64 0 

CATCATATAA ATATGCACCG TGTCCTTCTT TCATAACAAC AGGTGCACCG CCTCCTACAG 27 00 

CTTTATAAGA ACGAGAAGGG GAATTGACAC CGCCTAGAAT ATATTCGTTT GAAAGTTGTT 2 760 

GTAAACGTTC ACTTTCACTA AAATTCATTT ATATCAACCT CTTTTAATTT AATATTTTCA 2 820 

TCTAATATCG TATCATAAAA TTATTATAAT GAAGAAAAAG GTGATTATAT GTTGCAAAAA 2 880 

GGAGAACAAT TTCCAATATT TAAATTAGAA AATCAAGACG GAACTGTCAT TACAAATGAT 2 94 0 

ACATTAAAAG GTAAAAAGGC GATTATATAT TTTTATCCTA GAGATAATAC ACCTACTTGT 3 00 0 

ACCACAGAAG CTTGTGACTT TAGAGACAAT TTAGAAATGT TCAATGATTT AGATGTTGCA 3 06 0 

GTATATGGTA TAAGCGGTGA TTCAAAGAAA AAACACCAAA ATTTTATTGA GAAACACGGA 3120 

TTGAATTTCG ATTTATTAGT AGATGAAGAT TTTAAATTAG CTAAAGAAAt GGCGTATATC 3180 

AGTTAAAAAA ATCATTTGGC AAAGAAAGTA TGGGCATTGT AAGAACGACT TTTATAATAG 3 24 0 

ATGAACAAGG TAAAGTATTA GATGTTATCG AGAAGGTTAA GGTTAAAACA CAAATAGAAG 3300 

AACTTAAAAA CATTTTGGGG TGACATATAT GAAAGTTGTT GGGTTAAATC GTATGCGTGA 3360 

AGTTGAAACT GAATTACAAC AACGCTTTTC AGATTTAGAT TTTAAATTTT ATAAAAAAGC 3420 

ATCAGAAATA CCTGAGAGCG ACTTGGCTGA TTTAGATATA TTAGTTGGTT ATGATGGCGG 34 80 

TATCAATGAG GCATTTTTAC GACGTTGCCC GAATTTAAAA TGGATTGCAT GGTTTGCAAC 354 0 

GGGTGTAAAT ACATTGCCGT TAGATTATAT TGCAGATCAC GGCATACTTT TAACTAATGG 3 600 

AAAAGGTGTT CAAGCTAAAC AATTATCTGA ATACATTTTA GCTTTCATTT TAGATGATTA 3 660 

TAAAAAGATG AAA CT AT CAT ATGATAACCA ACGACAACAT ATATATGATT CGAAAATAAC 3 720 

TGGTAAACGC CTATCAGGAC AAACAGTTTT ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3 780 

AACTGCGAAG TTAGCAAAGG CTTTTAATAT GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3 84 0 
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TGCTGACATT 


ATTATAAATG 


CTTTACCAGA 


AACGCAAGAA 


ACGATTCaTT 


TaCTAAAGAA 


3960 


AAAACATTTT 


GAATTAATGA 


AAGATGAAGC 


ACTTTTTATA AATATAGGAC 


GAGGTAGCAT 


4020 


AGTTAAAGAA 


GCG CTCTTAA 


TAGAAGTATT 


AAAAAGTAAA GTTATTCGAC 


ATG CATATTT 


4080 


AGATGTGTTT 


GAAAATGAAC 


CTTTGAAACC 


TAATCATGAA 


TTATATGAAT 


TGGATAATGT 


414 0 


AACTATAACA 


GCGCATATAA 


CTGGTAATGA 


TTATGAAGCA 


AAGTATGACT 


TATTAGATAT 


4200 


TTTTAAAAAC 


AATCTAGTTA 


ATTTTCTCAA 


TAAGAATGGT 


CTAATTGAGA 


ATGAAGTTGa 


4260 


TGCTAAAAAA 


GGCTATTAAA 


TGArATCATC 


ATGTAAATAT 


TGACACGCGC 


GCAATACTAC 




AGTTATATTT 


aTAGTAAgTt 


AATaATgATT 


ATATAAGAaA 


GATGGTgATA TAGATGAGTG 


43B0 


TTGAAATAGA 


ATCAATTGAA 


CATGAACTAG 


AAGAATCAAT 


TGCATCATTG 


CGACAAGcAG 




GCGTAAGAAT 


TACACCTCAA 


AGACAAGCAA 


TATTACGTTa 


TTTaATTTCT 


TCACATACTs 


4500 


ATCCAACAsC 


TGaTGaAATT 


TATCAAGCAC 


TTTCACCTGa 


TTTTCCAAAT 


ATAAGTGTTG 


4560 


CGACAATATA 


TAATAACTTA 


AGAGTGTTTA 


AAGATATTGG 


AATTGTAAAA 


GAATTAACAT 


4620 


ATGGAGACTC 


ATCAAGTCGA 


TTCG ACT TTa 


ATACACATAA 


TCATTATCAT 


ATTATATGTG 




AACAATGTGG 


TAAGATTGTT 


GATTTTCAAT 


ATCCACAGTT 


AAATGAAATT 


GAAAGATTAG 


4740 


CTCAGCATAT 


G ACT GACTTT 


GACGTAACAC 


ATCATCGAAT 


GGAAATTTAT 


GGAGTTTGTA 


4800 


AAGAATGCCA 


AGATAAATAA 


TTTAACTTTG 


GTAGTATGAC 


AAATTAAAAA 


AGCGTTACTw 


4860 


ACTTCATATA 


AGTAAGCGTA 


ATATTTAAGA 


nGTTAAACGA 


CATGaAAGTt 


GTTTAACTTT 


4920 



TTT 4923 
(2) INFORMATION FOR SEQ ID NO: 55 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 



TCCGGCTTTA 


AAAACTTTTC 


CCAATTCCAG 


CTTGGGCCTT 


TGGCATTAAT ATTAACCTCC 


60 


TGGTTCGGAT 


TAATTGGGAA 


CCTAACCnTT 


TAGGCAATAA 


TTGGTTTAGG CAACTTCCAA 


120 


ATTGGTGGTT 


CAACCAACGT 


CTTTGGATAC 


CyTG CTCATT 


TAATTCTAAA ATGgTyrGAA 


180 


CGCATTTTGG 


TACCCAAAAt 


GgTGACGTTC 


GTTTGCACGG 


TCTAATAAAT TGTCTAAGTT 


240 


GTCGATTGGT 


TTCATTAATT 


GAa CACCATT 


TTGCTCTTCA 


GCAAGACCTT TGTCTACTTT 


300 



1491 



EP 0 786 519 A2 



GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 420 

TGAAGTTACC ACACGTGTAC GCATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 48 0 

ATATTGATCT TCGTTTACAC CATATTCTTT AAGTG CTTTT GGTGTACTAC CACCACTTTG 54 0 

GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 60 0 

ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 66 0 

AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 72 0 

TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 780 

ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG GACGTTGCGC AATAAATACT 84 0 

AAAAAGGGAT ACAATG CT AT AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 90 0 

CTTTTAGAAG TATGAAA 917 
(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 

ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT GO 

TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 120 

AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA GAACTCCATA 180 

TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 24 0 

AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 300 

AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 3 60 

ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 4 20 

ATCTCTTTCT CT 4 32 
(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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TTTTGCTTTA TGTTAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 6 0 

TGTGTAGGAT GTTTTCCkAA CCCAATTTTT ACATCCGAAG ATATCGAACA ATAT CTTCCT 120 
5 ACTTCTACAT TATTAAAATC ACTACCAAAT CCAATATAAC TGTATTCACC AATGTGAGAA 18 0 

TTCCTGATTT TACACCATCT ATCTATATAG TTATTGCCAT CAAATTTTGA GTTTGTAATA 24 0 

TACGCCAAGC GATGAATCTT AACATTCGAT TCTTTAGAGG ACTGGTTTTT CAGCAAACCA 300 
ATTATCTTTT CAATCGCTAT CCTCATCGTC ATTTCCTCCA AGTATTTCGA TTGATAATAT 36 0 

CTTTATAGCT TTGAATAATT TTAACTACCT TTGTCGAAAC GTTAGTGTCT TTATAATCAA 42 0 

JS TAG CATCAAT CATCGGTTCG TTATTGTTTT GCATCTCTCT TGCTAGTTCA ACGGATTGGA 480 
TTAGATTGTT ATAGGTAATA CCACCTACAA TAACCGTACC TTTATCTAGT ACTTCCGGTC 54 0 

TTTCTGTGGA AGTTCGAATA AGGACAC CAG GGAACTTCAA AATAGACGAC TCTTCTGACA 60 0 

20 ATGTTCCACT ATCTGATAGC ACAACAAATG CATCTTTTTG CAATGCATTA TAATCAAAGA 66 0 

AACCAAATGG CTTTAACTGT TTAACTAATG GATCAAATTC AAATTTACTT TCTTCAATTT 72 0 

TCTTCCAACT TCTTGGATGC GTTGAATAAA TCACAGGCAT TTTATACTTT TTGGCAATAT 78 0 

25 CATTTATCGC ATTCATTAAT GATTTAAAAT TCTTTTCATT ATCGATATTC TCTTCTCTAT 84 0 

GCGCAGaTAC TAAAATGTAT TGTTGCGGTT CTAATCCTAG TTTATTTAAA ACGTCACTGT 900 
GATTAATTTT ATCTCGATGC GCTTCTATCA CTTCTGTCAT CGGTGATCCT GTyACAAAGA 96 0 

TATTCGCTTT ATTGAAGCCT TCATCTAATA AATAACGTCT GCTATGTTCC GTATAAGGTA 102 0 

GATTCACATC ACTGACATGG TCAACAATTT TACGATTGAT TTCTTCAGGT ACATTCTGAT 108 0 

3S CAAAGCATCT ATTACCCGCT TCCATGTGGA ACACAGGAAT CTTTAATCGT TTAGCAGATA 114 0 

CTGCTGCTAA ACAACTATTT GTATCACCAA GAATTAAAAG TGCATCTGGT TGTTCGCGTA 12 00 

ATAAAACATC ATATGTCTTC GCAATAATAT TCCCCAtCGT TTCTCCAnGt TACTTCCAAC 126 0 

40 TGCCTCTAAG TAGTGGTCCG GTTGTCTTAA TTCCAAATCA TCAAAGAAAA TTTGATTCAA 132 0 

TGTATAATCA TAATTTTGAC CAGTGTGTAC TAATATCTGA TTAnAAATAT TGAT 13 74 
(2) INFORMATION FOR SEQ ID NO: 555: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 

55 
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TT AC CTCTAG GCATGTCCCT TTCACGGTTT GCTTATGATA ACGTTATCGA CATTGTTAAA 120 

TTGT C CTTTT TGAAAAAATA ATTCTGGTGA GATAGACGCT ACTTGGaTTT CaTCaGTTTG 180 

TAAtAACGCA GtATAAtTAC CaTTACTAAA tTGaGTTAAT CGTTCaTATA AAGTACTAAT 24 0 

AGGATAATAA ATGTTATCCG TTAAgCGCGC CGTATAGTTC ACTTGATACG TTTCGCCTTC 3 00 

AACAATTGCT TGTTGGACAC GTTTAATATT AGTCATCATA ACCTCAGAAG ATTCAACAAA 3 60 

TGAAAAATGA TACTTTGATA CATAAGAAGT TTGATGTTCA TATGTTGAAT TTATGCTTTC 4 20 

CGCTTTTTCA AAACTATAAG CTG CTGCATA AATATCATCT TTAGCTAATG AATGTGTACA 4 80 

CATAG CATG A TTAAAATACT TTGCCGCTTC GTAACTTAAA TATAACGAGA CATATCTACC 54 0 

TTGTCGTkGt GCGCTTGTGC AAAGTGTATC ACTTCTCCCA CATCAGCCAA CTTAGTAGCA 6 00 

ACATACTTCT TTATAAATCC CTTTAATTGA ATATGGTATT GCTTATATTC ATTTTCAGTT 6 60 

AAATAGTAGC GATAATTATA TTCTATTCTC ACAGTAATCA CCTACCTTCG ATAAAAATAA 72 0 

TTCAACTTGT CGATAACCGT ATTCACTCAA AATAGAT TCA GGATGATATT GCACACCAAA 780 

AACCGGAAAT CTAATATGCT CAAATGC CAT AATAATCGCT TCATCGTTTT TTGCTGTAAT 84 0 

CTTTAAG CAA TTTGGAAAAG TCGCTCCGTC AGCAATTAAT GAATGATAAC GCATTACATT 900 

GAAATTTTGA GGCAGTCCTT GAAAAATACC TTCATTGGTA TGGCGTAACT GTGTAGTATG 960 

TCCGTGTACA GGATGATAGC CGTGAATGAT ATTTCCACCA AAATAAGACA CGATACATTG 102 0 

AAATC CTAAA CATACACCTA GTATAGGTAC ACG CTGATAA AATTGTTCTA ACACTTCATT 1080 

CAAGATAGGA TAATCATCCG GaTTACCCGG CCCAGGCGAA ATAACAATTG CTTTTGG CTT 1140 

CATATTAATG ACGTCTTCTA TCAGCAGATT ATCAATACCA ACAACTTGAA CTGTTAGTTT 12 00 

CGTTTGAGTC TTAATATAGT CTATTAAATT ATATGTAAAT GaATCATTAT TAT CTATGAC 1260 

TAGAATCATT GTATACTCCG TTCTAAATGT GTTTTATTTT TATAATATGT ATTGGATGTA 1320 

GCTAAAACTT TAAAAGCATT GTCATTATCC TGACCTTGAT TTTAACTAAT ATATGGTATA 1380 

TTCTATTCAT CGTACATAAA TGAATATCAG AGGTTCCTAG CTGAAACCCT CTATAAAAAA 1440 

CTAGGC CATT GAAATTTCAA ACATTCGTTG GG 14 72 
(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AGAACAGCAA GGATTACTTA CTGAGGAATT AAAGAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 12 0 

5 GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGG CACGTA AACATGAAGT 180 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 24 0 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATTTCA GATAATCCTA AATATAGAAC 300 

w 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT ACATCTAAAA AGAAAAATGC 36 0 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT 420 

)5 TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 4 80 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTT CATTGCA CGTCAAGAAA TCAATCATAA 54 0 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTG CAA CCTCCAATGA AAGGTAAACA 72 0 

AATATTAGGC GTAGATCCAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCCATT 780 

25 CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC 84 0 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AGCGTATGAT GTGCAATTAA TTGCAATTGG 90 0 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960 

30 

GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCGCT TCAGTATACT CAGCATCAGA 1020 

AATTGCTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
35 (2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

45 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 
AAATTT CTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 120 
GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 18 0 

so 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 24 0 

TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 300 

55 



1495 



EP 0 786 519 A2 



AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 420 

TGCTCCCTTC AAAGTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGGCAT TTTCATTAAA 4 80 

TTGTTATAGC TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 54 0 

GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG 6 00 

ATTGAATGGA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA 660 

ATAATACTTG CTTCACCACT ATACAAGGAG TGAGTTGTTA TGTTCAAAGT GAATTATTCG 72 0 

ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 780 

AAAACAATAA AAAACAAGAT TTAATACTGC TGCGTCATGA GGCGCTTGAA CATTATTTGA 84 0 

TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 900 

ATTCAATTTT TATAAAAaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 96 0 

GATGaACAGG t TATAATATA TGraTATATa CCTGaAGATG aTATAAGTAC CGGGTAAAGG 102 0 

GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 1057 
(2) INFORMATION FOR SEQ ID NO: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 558: 

CTGTGCTGTA TTTACtTTAA TTTGACGACA TTTGAGAAGT ATTATGATGG nTGTAGTTAT 60 

TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 120 

GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GGAGATGTTT GTGACTTTTC 180 

GATGATTAAG ATTTTTATAG GTGTGCATCA TTT CAAATAA ACTTTGTGTT TAAAATTGAG 240 

CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTG TCT TGTATCCTTA 300 

GTTGTTATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 36 0 

ATTATTGTAA AACAATAGTT TTAATTTnAT TTTCTGAT a T ATCATATGTr ATTcTACCTG 4 20 

TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 4 80 

CTTTTCCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 54 0 

ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAACCAGTTT 6 00 

TAGAATAATG AACCTTACCT AATTTCGTTT CAATATATTT ACCATTCACA TAAGATTTAC 660 
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AAACTGGAAT ATCAATAAAT TCTTTTTTAG TCATCTTTTC ACTTTCATTA GAATCTATTA 780 

TAGTATAACT TGGTAATAAA GATGAATTAT TTTTCTTATT ATTATATTCA TTTGTGCATA 84 0 

AAATATGACG CTTTTGCTTA ACATTATTCA ATTTCATCGT CATTTTACCA TTACTTATCA 900 

TTTCAAGCAT TTCTTTTTTA GCTTTTTGTA GAAAGGATGC TATTTTGTAA TTATTTTAAT 96 0 

AATCTAATAC TTTTTATCTC AGTTTCATCA AACGAATAAA CAGCAAAATC AGTCTGTATA 102 0 

TCTACAACAA GATTTCCTGT TTCGCTCTCG AATTCATTTT CATAGTCAGT TACAAACCCT 1080 

TCGTATTCTT CATTATTCAA AAGCGTGATA ATAACATCTT TACGATATGC ATCTnCAATT 114 0 

CTCAATACTT TT CACCTACT TATCAATATA AGGTACTATA TGaGCACCTG GTCTTCGAAT 1200 

AATGGCACTT CCCTTTCTGG TTTCAATATA CACATTTTCG ATATGTATTT TTCCAATAAT 126 0 

TTGATTAAAA TTAATAATCT CTTTCAAATC AAATCGCTCA TCACTTAATA TTAGATTGCC 13 2 0 

TGTTGACATT TTTTCTCTTA ACAATTCATT CAATAAATCT ATAGAAAGTA TTGTATAGCT 1380 

AGGCAATTTC TTATTATTTA AAATGGCTCT TTTTTTATTT TCATTATATA GGTGATGACC 144 0 

TAACATATAT CTATTTTATT TTCCGCAATT TATTTCTATT TTTATTTTAC CATTTTTAAT 1500 

CATTTCTTTC ATCTGGTTTT TAGCTTTTtC CTGtAATTAT GCTTCTTTTA CTTCTACTTG 156 0 

ATATTTACCt TCACGCTCTT TAAAGAACTT GTCCCGCCAA TTG CCAACAT GTGGCACTGT 16 2 0 

GGTACTTCTA CACCAAGGAT GCATAGGTGG CGCATTCACA CCTGGTATCA TATCTTTAAC 16 80 

TTTAAATATT TTTCCGTTAA GTGAATGACA TAATTTAGAT GTTTTACTAT CTATTTTGGC 174 0 

AACATATTTA TATTCGCCAT CTTCACCAAG TTCTTTTAAA TATGTTAACT TTTGTGCTTC 18 00 

TGCATTTTCA GTAAATAGTT AAAAAAGCGT ATAAAAATAG CACCACTTTC TCTTTAkCTG 18 60 

TCTAAAAAGG ATGCTATTTA TCTTTTGAAT TTGAATTCTT TTTCGCTTTT TCTATACTTT 192 0 

CAAATTCTTC AACTAATTCT TTAAAAGATT CACTCAATTC TTTTGCAGTT ACATTTCCAT 1980 

CTAATTGTGA ATCTAACATA ATTAAAATCA TCTCACTTTA TATTTAATCa TATTTATACT 2040 

ATAAAGTTTT TTCAATAATT TTTCAATATG GCTATCATTA ATGATATTAA TATGCGTAAA 2100 

ATATTTAGCA CAAAATTTAC TCACTATTTT ACCATGAAAC CTATTTGACT TGGTAATAAA 2160 

TTTTACTTGT CCCTTATTAG TAACGATTGT CATTGATTTT ATTGATGGAT GCTTAAAAAA 2 2 20 

TGTAAATAAA TCATATTCTG AAAATCCTGA CTGTCCAGGA TGGTTATGTA ACATAACAAT 22 8 0 

TGAATTCGGT TTACTGTTAA ATAATAATTC GGTTGCTTGT TACCCTGGCA CAAAAGATAC 2 34 0 

ACTATCTTGA TTGACATATA CTTTTGTAAA TTTACCATCT TTTAACAAAT AAGCTACTTC 24 00 

ATTGCTATCA TTGTTTTCCA T GAG ATATAC CTC CGTTTAT AGtCTGTCGA CTGATATTCC 24 60 
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GAAGGTTAAA ATTTTTAAAG TAAACTTTCT TTATCCCCAT GCTACGAGTT CAGATTCAGG 2 580 

AAATAGCCCT TTACTAGTAT TTATGTATAT TCTGTCTATG GCATGAATAA AATAATTATC 2 64 0 

TCTTGTATTT tTTTCTAAAC TAGATTTTTC AG CATTGATA ACTTCAAGAC TATCTATATC 2 7 00 

CATTTGAATA ATACCAGGCT TAATATTTTC ATCATTATTA GGAAAATATT TATATGTAAC 276 0 

ACTTTTATCA TTAATTTCTT TTATTTTTAA TATTAGCAAT CATTTCCACC TCTAATTAAT 2820 

TAAAATACTA TAATTATATT TTATTTCTGT AAGTTTATGT GCCTCTATAT AGTGTAAATT 2 880 

ATA CTTATTC ATTAGATAGT GTTCAAGAGC TTCATGTTTC TACATTATTA TATCCATTTT 2 94 0 

TTTAATATTT TTCCCTTCTC TTAAACGTTG CCAACTTTGA GCCATATAAA AGTCAGGATC 3 0 00 

AAATTGTTTA AATCCACTTT CTAATAAATA CTTATTTTCA AATATATGTT CATAAACTCT 3 060 

TTGAATTAAA TTTTTATTTA TATTAgTATT TTTAGCAATT TTAGAAATCT CTATCTGTTT 3120 

ATCTCGATTT CTAACTGAAT TATAATAAAT TTGAGCATGT CTGTTCCTTT TGATACCGTA 3180 

TTCATCACTT TTATTATTAA GTGCACCTGA TTCAATAAAA CAACCTTCTA CTTGATATTT 3 24 0 

ACCTTCACGC TCTTTAAAGA ACTTGTCTCG CCAATTGCCG ACATGTGGCA CTGTGGTACT 3300 

TCTACACCAA GGATGCATAG GTGGCGCATT CACACCTGGT ATCATATCTT TAACTTTAAA 3360 

TATTTTTCCG TTGAGTGAAT GACATAATTT AGATGTTTTA CTATCTATTT TGGCAACATA 3420 

TTTATATTCG CCATCTTCCA CCAAGTTCyT TTAAATATGT TAACTTTTGT GACTCtTCyT 3480 

TTTCAACGAA TAATGAAAAA AGCATATAAA AATAGCATCG CTTTCTCTTT ATCTGTGTAG 354 0 

AAAGGATGCT TTAATACCaT GCTATTTTAT AATTTTcGGG AAATTcTTGC TTCTCGATAA 3600 

AGTCTCTTAC TACAGAAAAA GACTTATTAC GATATAACAT AAAATATTCT TCATTTTCTA 3 660 

TTTCAGATAA TGAGAAATCT ATAACCTCTG CATCTTTTTT ATTAAAAGTT ACTGAACCTT 3720 

TAC CGTTACT TATATCATCT TCAGGTATAT ATTC 3754 
(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCGAAA 60 

AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 120 
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CTGATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 24 0 

CTGAAATGAA TATTAC GGT A GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 3 00 

5 GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 3 60 

AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAACGTCAG CGTGCATACA 4 20 

TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTG CT AGATGAACCA TTAAATAATT 4 80 

to 

TAGATATGAA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTTAGCG CATAAAATGA 54 0 

ATAAAGCGAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAGA 600 

)£ TTGTAGCATT GAAAAACGGA CAACTAG TTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 660 

GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TGAACATATA AGAAATCAAA 72 0 

GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 7 80 

20 ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT 815 

(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 
GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 60 

3£ CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 12 0 

TTATGTTnAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT 180 
TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAACG 24 0 

40 CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 300 

AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 36 0 

TATTTTTATC CCATAGTGTG ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 420 

45 

GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 480 
GTGACGTACA TGATCCGTCT AGGTAAAATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 54 0 

GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 600 

50 

CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACGTATT AGAGGAAAAT 66 0 

GACAAAATTT ATGGCTTTAT TGTTGTCGAC CAAGACCAAG CAGAATGGTA yGnyGACATT 720 

55 
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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGCACGTGGT 84 0 

GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 9 00 

AAATTTGGAT TTCATAAGG 919 
(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 

ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 60 

CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 120 

AGAAGCATGT TCATACTGCA AAT CTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 180 

ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAG CTTGAT TAACAATTGC 24 0 

TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 300 

AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 36 0 

TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TG TGaTTCT A ATGCTAAAAG 42 0 

TTTAAACAAT GCCTTGCnAT CATAGAG CGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 480 

AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 518 
(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 

CTTATTTAAA AGTAAATCAA TCAATGTATT ATAATCCGAA TAGTC CGCAT AAAGCTGGTT 60 

TGCaGGCAAA tCAATTACTA CAACAAGCAA AAACCCAAAT TAATGCAATG rTTaATTCAA 120 

AAACAAATTA TGATGTTGTA TTCACTAGTG GTGCAc TGAA TCCAATAATC TTGCTTTAAA 180 

AGGTATTGCC TAT CGTAAAT TTGATACAGC GAAGGAAATA ATTACATCCG TGTTAGAGCA 24 0 

TCCGTCCGTA TTAGAGGTTG TAAGATATTT GGAAGCACAC GAAGGATTTA AAGTTAAATA 3 00 
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CAAAGTCGGT TTAGTAACAT GTATGTATGT AAATAATGTA ACTGGACAAA TACAGCCTAT 4 20 

TCCACAAATG GCTAAAGTTA TAAAAAATTA TCCTAAGGCA CATTTTCATG TAGATGCGGT 4 80 

S TCAAGCATTC GGCAAAATTT CAATGGATCT CAATAACATA GATAGTATTA GTTTAAGTGG 54 0 

ACACAAGTTT AATGGTTTAA AAGGACAAGG CGTCTTACTT GTAAATCACA TTCAAAATGT 600 

TGAACCAACT GTCCATGGTG GTGGTCAAGA ATATGGTGTT AGAAGTGGAA CAGTTAATTT 6 60 

GCCAAATGAT ATTGCAATGG TTAAAGCGAT GAAGATAGCT AATGAAAACT TTGAAGCATT 72 0 

GAATGCATTT GTTACTGAGT TAAATAATGA CGTCCGTCAA TTTTTAAATA AATATCATGG 780 

1S AGTTTATATT AATTCTTCAA CTTCAGGTTC ACCATTCGTT TTAAATATTA GTTTTCCTGG 84 0 

CGTAAAAGGT GAAGTATTAG TTAATGCTTT TTCAAAATAT GACATTATGA TATCTACGAC 9 00 

AAGTGCTTGT TCATCTAAAC GTAATAAATT AAATGAAGTA TTGGCTGCAA TGGGATTATC 960 

20 AGACAAATCT ATTGAAGGTA GTATAAGATT ATCATTTGGG G CTA CTACAA CTAAAGAAGA 102 0 

TATAG CGAGG TTTAAAGAAA TATTTATCAT CATTTATGAG GAAATTAAGG AGTTGCTAAA 1080 

ATAAT GAAGT ATGATCACTT GCTTGTTAGA TACGGGGAGT TAACATTAAA GGGTTCAAAT 114 0 

25 AGAAAGAAAT TTGTAAATCA ATTAAGAAAT AATGTAAATA AGTCaTTAAA AGGACTTGAT 12 00 

GGGTTTGTCG TTAAAGGCaA ACGAGATCGT ATGTATATTG AACTTGAAGA CCATGCaGAT 12 60 

ATAAATGAAA TAACATATCG ATTATCAAAA ATTTTCGGTA TTAAATCTAT TAGTCCAGTA 1320 

30 

TTAAAAGTAG AAAAAACAAT AGAGGCAATA AGTGCAGCGG CAATTAAATT gCGCAGaATT 13 80 

TGAAG aAAAC AGCACATTTA AAATTGATGT GAAGCGTGCC CGATTAAAAT TTCCCCAATG 14 4 0 

35 GATACGGTAT GGAATTACAG CGTGGAATTG GGGTGGTGCC AGTATTGGAG CACTTCGCCA 15 00 

TATTTCCAGT GGATGTCCAA CGTCCCAGnC CCAGGAATT 153 9 
(2) INFORMATION FOR SEQ ID NO: 563: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 

ATAnCGTATA CATGTGTTCT TTAAAATTGT GATAAGGAGT TTAGGATGGT TTATTTAAAA 60 

so 

TCAATAGATG CCATTGGATT TAAGTCTTTT GCAGATCAAA CCAATGTTCA ATT CGATAAA 120 

GGTGTAACTG CAATTGTTGG TCCAAATGGA AG CGGTAAAA GTAATATTAC AG ATG CTATT 180 

55 



1501 



EP 0 786 519 A2 

ATCTTCTCAG GTGCAGAACA TCGCAAAGCT CAAAATTATG CTGAAGTACA GTTAAGATTA 3 00 

GATAATCATT CTAAAAAGCT CAGTGTTGAT GAAAACGAAG TTATTGTAAC AAGAAGATTG 36 0 

TATCGAAGTG GTGAAAGTGA GTACTACATA AATAATGACC GTGCAAGATT AAAAGATATT 42 0 

GCCGATTTAT TTTTAGATTC TGGATTGGGA AAAGAAGCGT ATAGCATTaT CTCGCAAGGT 480 

AGAGTTGATG AAATACTAAA TGCT AAA C CA ATTGATAGAC GTCAAATTAT TGAAGAATCG 54 0 

GCTGGTGTAC TTAAATATAA AAAA CGTAAG GCTGAATCAT TAAATAAACT TGACCAAACA 600 

GAAGATAATT TAACGAGAGT AGAAGACATT TTATATGATT TGGAAGGTCG CGTAGAACCT 66 0 

CTAAAAGAGG AGGCAGCTAT AGCTAAAGAA TATAAGACAC TTTCACATCA AATGAAACAT 72 0 

AGTGACATTG TAGTTACAGT sCACGATATT GATCAATATA CAAATGACAA TAGACAATTA 780 

GATCAACGTT TAAATGATTT ACAAGGCCAA CAAGCAAATA AAGAAGCTGA CAAGCAACGT 84 0 

TTAAGCCAAC AAATTCAACA ATATAAAGGT AAACGTCATC AACTTGATAA TGATGTTGAA 900 

TCgCTTAATT ATCAATTAGT AAAAGCTACG GAAGCCTTTG AAAAATATAC GGGACAATTA 96 0 

AATGTTTT 96 8 
(2) INFORMATION FOR SEQ ID NO: 564: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 

TTGTGTGTAT GCATTCAATG TGCTCTGTTT GTAAATGGCT AG CTATATAA TTTAgGATTC 60 

GAGGATCGTC ATCGACAACA AGACATTGCA CCATAGCTAT AAACTCCCTT ATCTTTTTCA 120 

TTTATTATAC ATGTAAAATA TTTTTGCGTA AAAAAACAAT TGTTCATATT GAGTTCATAT 180 

TTCAACCTTA TACTGACGCT AAAGAAGAAA TAGGGAGAAG TGAATCGATA TGAAATTAGC 24 0 

GATAAAAGAG ATTATGTTTT ACAAATTTCG TTATATTTTA ATCACATTAA TCATTCTTTT 300 

ATTAAGTATT ATGGTGTTAT TTATTAGTGG TTTAGCTCAn GGGCTTGGTA GGGaGAATAT 360 

TTCGTwATTT GAACACTTTG GATAATGATG aaTATGtTGT TCaAAAAATG AAAGAGCCGC 420 

aAATTGaGAA ATCGCA 43 6 

(2) INFORMATION FOR SEQ ID NO : 56 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2554 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

s 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

ATGTTTTGTC AATATCAATT GTTTGAGATA AATCCGCTTG TATAAAACGA ACTTTATCAT SO 

w CATTAAATTT GCTAGTTAAT TCATTTATAT CAGTACGATA ATATTGCACA TAAACTTCAA 120 

ATCCATCAGT TAATAATTGT TTGACTATCT CAGAACCAAT TGAACCAGAA CCACCTAATA 180 

CTAATGCTTT CATTACTTTT TAATCTCCAA ACGACTATCG ACTTGCTGAT CTAAATTTAA 24 0 

15 ATATAACGAT GACGTTTCGT TAATACTATC TAACGTGATA TTTTCAACAA TGTCTAACAT 300 

GTCAAACACG CTAACAC CTT CAAAATACAA TTTAGTATAT TGATTAGCAA TATATTCAGG 360 

TGAGTTTAAA CTTGATATGA ATTCACCTAT AAATTGCTTT TTCAAAAGTT CAAATGCTTC 4 20 

20 

TGCATCTTGG AAA t TG CCTT TTTTATCACG CAACTCATCT AATAATAATT TTTTTAATTT 4 80 

ATCTGGTTCT TCAGTAGCAC TTGT CACGAT TGAAAAACTA TACGTCGGCT CTAGTACAAA 540 

TTGATAACCA AATGT AT CAT CGATAAGTCC TTCGTTTAAT AAATTCTGAT AAAAAT CTGT 600 

25 

TTCTTCCCCA AAAATTAACT CAAAGAATAA TGACATTTCT AAATCACGTT GTACATATTT 660 

TTGAGGCGCT TCTTGTAATG GTTTATTTTT AAAACCAAGC ATTAGTCTTG GTGATTGAAT 720 

30 TTTCATAGAT TCAGTAACAA ATGCTTCTTT AACATCcTCC GGTTCATCAA CAAGTCCTCG 780 

TTCGATTTTG GGTTGGTTAA CTTTATTACG AGCATCCTCG TGTTGTTTTA CTATTCgACA 84 0 

TATTGCTTCA GGATCCACAT CGCCAACAAC AAATAAAACC ATATTTGATG GATGATAAAA 900 

35 CGTTTCATAA CATAGATACA AATCATCTTT TGTAATATCG TATATACTTT CTACACTACC 960 

GGCAATATCA ACACGTATTG GATGTTGTTG ATACATTGCA CGCAAtGTAT TaAACATTAA 1020 

TTTATATCCA GGTTGCTCTT GaTACATTTT TATTTCTTCT GCAATAATAC CTTTTTCTTT 1080 

40 

ATCAACAGTT TCTTTTGTAA AATAAGGCGT TTCaACCATT GTAAGTAAAC GTTTAATGTT 1140 

GTTTT CAATA TtATCAGTTG CACTGAACAA GTAGCTT GTA CGATCAAAGC TTGt AAACGC 1200 

ATTTGCTTGT GCGTTATCTT CAGCAAACGC AGTAAATAAG cTTCTTCTTC TTTTTCAAAT 1260 

45 

AATTTATGTT CcTAAAAAGT GAGCAACTCC ATCAGGTACA GTAACAAATT GGTCTTGTCC 1320 

AAGGGGTTTG AATTGATTAT CTAATGAACC AAATTGTGTA GTGTAAGTGA CAAATGTCTT 1380 

SO TTGAAAACCT GGTTkGGGGA TAATAAATAA TCGTAAACCA TTTTCTAAT t CTTGTTCGAA 144 0 

tACTCTTTCG TCTATTAATT CATAATAACG CTCTTTCATT ATTTATCCCC TCCTTTTGTC 1500 

AACACATAGA TtGTATCTAA AAATGCTTTT TCAGCAACAG AAACAATATC TTCGCGACTT 1560 

55 
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TTATGCATAA TCTCTATAAT ACTTTTCGGA CGATCTTCAG ATTCATATCG ATGAGAAATG 16 80 

ATTACTTTTT TAGCTAACTC TAATTTTTCT TCAGTGAAAT CTCCTGCTTT TATTTTTTCA 174 0 

5 AATTCACTTA TAATAGTGTC TTTTG CAGTT TCGTACTTAT CACTTGAAAC CCCACTCAAA 180 0 

ACAAATAAAT AGCCATTTTT GCCATCAATT TGTGAATGTA TAGAGTACGC TAAACTTTGC 186 0 

TTTTCTCGCA CTTCATTAAA TAAAACAGAT GAAGGATCTC CTCCAAACAT CATGTTAAAT 1920 

10 

ACAACAAAGG CAGCATATCC ACTTTGTCCA TATTGTGTTG GAAAACGGTA TCCCATATTT 1980 

AATTT AG CTT GATCCACGTC ATCATATTCA ACAATATAAT CAACTTCTTC ATCGTGTAAA 204 0 

T5 TGATGAGTAG AATGTTGGAA TTGATGTTTA TCGAATGGTT TAAGTGCAAA TTTTTCACGT 2100 

ATTTGTTTCT CAACACTTTC AGGTTCTACA TTGCCGACAA CATAAACAGA ACATTGATCA 216 0 

TTATTAATCA TTGATTGATA TGTATGATAT AGTGTTTCAG CAGTAATATG TGGGATTTGT 2220 

20 TCTAGTTGTC CTGTAGATAA GTATTTATAT GCTTCATTTT CAAACATATG GTCGAGTAAT 2 28 0 

TTTAAAAACG AATATTGTGC TTTATTATCT ACCATTGCTT CTATTTTTTT GGCTAATAAT 2340 

GTTTTCTCTT GGTTAACAAA ATT AT CATTG AATGCTTTAT TTTCAATTAA TGGATTCCAA 2400 

25 

ATGATTTCTT GtAATAAATC TAATCCTTGa TTAAATAATG AwTCACCGkT TCyTAAATAA 246 0 

CGkkCaTTAA caATTyCTAA tGaAAATGt A ATgACaTGCT GaTCTTTGAA TTTTG AAATT 2520 

3Q GTACTATTCA CATACGCACC ATATAAATCG GCTA 2554 

(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1424 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 

TnTCG CTATT TTnAATTGGT TTTGTATGGT TTAAGTTATA TCAATATACA ACAnACCCTA 60 

4S AAGCTGATAT CCCAGGTATC ATTTTTAGTA CGATTGGTTT TGGTGCTTTG TTATATGGTT 120 

TCTCAGAAGC TGGCAACAAA GGTTGGGGTT CAGTAGAGAT AGAAACAATG TTTGCGATTG 180 

GTATTATCTT TATTATTCTA TTCGTTATTA GAGAATTAAG AATGAAATCA CCAATGTTGA 240 

SO ATTTAGAAGT ATT G AAATT C CCaACATTTA CATTAACAAC AATTATTAAT ATGGTTGTAA 3 00 

TGTTAAGTTT ATATGGTGGT ATGATTTTAT TACCGATTTA TTTACAAAAT TTACGCGGAT 36 0 

TcTCAGCATT AGATTCCGGA TTGTTATTAT TACCTGGTTC TCTAATTATG GGTCTACTAG 420 

55 
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TTGCTGTAAT GACTTATGCA ACATGGGAAT TAACTAAATT 
TGACAATCAT GGG TATCTAT GTACTTCGTT CATTTGGTAT 
5 TGGTAACTGC AGCTATTAAT GCGTTACCGG GACGACTTGC 

TAAATACGAT G CGTCAATT A GCAGGCTCTA TAGGTACAGC 
CAACACAAAC TACACAACAC TTATCAGCTT TTGGGGAAGA 

to 

TTGTACAAGA TCATATGCGT GAATTAGCAT CACAATATGG 

AAGTGTTACT ACAATTTGTA AATAAACTAG CAACGGTTGA 
, 5 TAGTTGCAAC G AT ATTT AG C ATCATCGCCT TAATTTTATG 

AAAAAGCAAA AGCTACAGCT CAAAAGTTAG ATGCAGATAA 

TAAAATAAAT TAATTGAAGT GTGACTAATC AAAAATTATG 
20 AAGTATCGGT GCCAAATATG GTTATCGATA CTTTTTTTAT 

AGAGGAATTA TATTAAAATT TGG CATTG AC GTAGTAGGTC 

AAGTGGGTCA ACGAAATGAA TTTTGTGAAA ATAACATTTC 

AACATTATTT TAAACGAGGC ATG CGATT AC GGAAGAATAA 

TGCTACAGCC AAGTAAAATG ATG CCGTTAT GAATAGCGTC 

3Q CTGGTGGTAC ATTTAAATAA TATTTTTTGA AAACATCTG C 

(2) INFORMATION FOR SEQ ID NO: 56 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 676 base pairs 
35 (b) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 
TCTCCCATGT TCTG CTAAAT GACGCATCAC TTTTACTTCA TGAGGCGTCA ATACACGTCC 60 
TTCACCAGCA TTCAAACCGA CAACATTTAA AGGCCCATAT TCAATACGAG ACAGTTTCGT 120 

45 

CACTTGATGA CCAAAATGTT CGAACATTCT TCTGACTTGG CGATT ACGAC CTTCTGTAAT 180 
TGTAATTTCA ACCAATGTTG TGTTTTTATC TTTATCTTGT TTCTTAACTT TCACTTCAGC 24 0 

SO CGGTTGCGTC ATAC CATCTT CTAATTCAAT ACCTTTTTCT AGCGCTTTCA CTTCTTCTCT 3 00 

CATTAAATAA CCTTTTAATT TCGCAACATA TTTTTTCTTA ATTTGATATC TTGGATGTGT 360 
CATTAAATTA GTAAATT CAC CATCATTTGT GAGTAATAAC AATCCAGAAG TATCATAGTC 420 
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AAATATGGAT ACACCATATA 54 0 

GG CATTT AT A ATG ATG C CAA 6 00 

CTCTCATGGT AATGCTTTCT 66 0 

AATCTTAGTT ACTGTAATGA 720 

GTTAGATAAA ACGAATCCTG 780 

CGGACAAGAA GGCGCAATGA 840 

AGGTATTAAT G ATG CATTT A 900 

TTTATTTTTA CAAAGTAATA 960 

TAGTATCAAT CATGAATAAA 10 20 

TTGTGGGGAC ATGATTTTTA 1080 

TTGTTGATTT ATAGAATGTT 114 0 

ATTAATAAAG AAAAAGCAGG 1200 

TGTCCCAATC CCTACTATAT 1260 

GCTTATAACA AGTAAACCGA 1320 

ACTTgCTGTA ATCACTTGAT 13 80 

AATT 1424 
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ACGTCCTCTA TCATCAGATA CACTTGTkAT CACTTGAGTT GGkTTATGGA AkAAAATGkA 



540 



AAtTTTGTCT TCTAGTTCTA TTTTAATACC TTCAACTTCA ATCGTATCTG ATGGCTTCAC 



600 



s 



TTTTGTTCCT AATTCAGTGA CAGTCGTACC ATTCACTTTC ACTTTTCCTT CAGAAATTAA 



660 



AGTTTCTGCC TTACGT 



676 



(2) INFORMATION FOR SEQ ID NO: 568: 



w 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
1S (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 
20 GAAACGGTTC TACCAAAAAA CAGTAAGGGC TAAACCCAAT CATGGTAAGA CAAAAAGTAC 6 0 

AAATAGCCAT GCCCAAGTTG AACTCGCTGT ACGCCTATTT CTTTCTAAAA AGATAATAAT 120 

AAAAGCCAAT ACTAAATTAA TGATGAATCC AATGGCTAAA ATAATAGTAA ATAACGTTCC 180 

25 

TAAATCGTTT GAAAATGTAA ATCGCATAGT CTTTTCTCCT ATAAAGAAAG GCACAAAAAA 24 0 

ACATTTTGCA CCTTTCACGT CATATTATTT ATTCACAGAT AAAGTTAAAA TTGcATTGAA 300 

^ TTCTTCTTCA TTATTTGGGA ATGTTCTTTC TTCTATTTCT TTAATAGTAA TATTTACTAA 3 60 

TTTTAAATTT GTAGCTTCTT CAGAACTTAA AAAAGCATTA ATGTTTTTTT CTAATAACTC 4 20 

kAAAGTCTCA GCTGt AAAaG TTTTAAGTTT AATT 454 

35 (2) INFORMATION FOR SEQ ID NO: 56 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 94 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 

45 

ACGATATTAC CACCCTCTGA AATAACATCA ATCATGTGAT CTGTAAAGTC CCAAGGATGA 60 

CTTGTTGTGA AACGAACTCT TGGAATCGCT ATTTTAGAAA TTG CTTGTAA AAGATCTCCT 12 0 

so AAGTCATATT CTATATC CTG TAAATCTTTA C CAT AAGAAT TTACATTTTG ACCTAAAAGC 180 

GTTATTTCTT TGTAACCTTC ACGAGCAAGT T CACGTACTT CAT CTATAAT GTCTTCAGGT 24 0 

CTACGGCTTC GTTCTTTACC TCTTG TAAAT GGAACAATAC AATATGTACA AAACTTATCA 3 00 



55 
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TCAATAACGT CTCCTTCTTT AGACCATACT TCAACAACCA TTGCTTTAGA TAAGTATGCT 420 

TCTTCTAAAA TTTCTGGTAA ATGATGAATA TTATGTG TAC CAAATATCAT AT CTACATTT 4 80 

TGATACGATT TTAAAATTTT ATTCACTACT GACTCTT CTT GTGACATACA ACCACAAACA 54 0 

CCGATTAAAA TATCAGGTCG TTCTTTTTTC AAATrCTTCA AATTACCTAT TTCACTAAAC 600 

ACTTTGTTCT CGGCATTTTC TCTAATCGCA CATGTATTAA TTAAAATAAC ATCTGCAGTG 66 0 

TTAATATCAG TCGTkGCTTG a TAGCCTAAT GCyTCmAGTA TACCAGCAAT GACCTCAGTG 720 

TCATGTGCAT TCATTTGACA TCCATATGTT TTAATTAAAA ATGTACGCTC GTTCCCCATA 780 

CCGCGATATT TTyCATCAAT TtGGgAAATC nCTATTATAA CGAACTTCTk GTtTACCnCC 84 0 

TTTTTTnCGC TCCTTTAAAA TTAAGGCGGC TGATAAACAG GTCCAAAATA TTAC 8 94 
(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

TATCAATCCC A GAG CACATG CTGAACAAGA TCAAACATGG GAGAAGATTA AAGAACGCGG 60 

TGAACTTAGA GTGGGTCTTT CTGCAGATTA TGCACCAATG GAATTTGAGC ATACAGTTAA 120 

TGGTAAGACT GAGTATGCAG GTGTAGATAT TGATTTAGCT AAAAAAATTG CGAAAGATAA 180 

TAATTTAAAA TTAAAAATCG TCAATATGTC ATTTGATAGT TTGTTAGGAG CTCTTAAAAC 24 0 

TGGAAAAATT GATATTATTA TTTCCGGAAT GACTTCAACG CCTGAACGTA AGAAGCAAGT 3 00 

TGATTTTTCA GATTCATATA TGATGACTAA AAATATCATG CTTGTAAAGA AAGATAAAGT 36 0 

TAATGAATAT AAAGATATCm AAGACTTTAA TAATAAAanA GTnGGGG CAC AAAGGGACTG 420 

AACCAGAAAA AATCGCTCAA C 441 
(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 

55 
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AAGTCGTTCA TCTTTTTCTA CAAGGGTGTA AAAATAATCA ATCATATCGT ATAACGCTTC 120 

TTTAC CAATG ATGTCATATG GTGTTGTTGT CATTTAATCA CCCATTTTCA AAAATTTACT 180 

GTTACGAACT TAAGTTAATA TATAACTAAT ATAACATGAT TTTAAACATT TGAAAGAAAT 24 0 

ATGCATATTT GCCAATTTAA TTTATATTGT TTGAAAGTGT TTCTTTTTTC TTGAAAAAAC 3 00 

GTTGAACTTT ATTTAAAGGT tGATGATGTT CGAGGTTTAG TTCGTTtAAT AAAGATtGGA 36 0 

ACTTTTGTAA ACCTTGATTA TAGTCTTTAA CTTCGAACTC TAACTCATAA TCCGTAGTAT 42 0 

CGAAATACTC ACTTTTATCT AAAACCAGTA AATCACCTTT ATATTT AG TT TCTTGGCGAT 4 80 

ATGTCGTTAA TGCACCAAGT ATTGATAAAG TTGTATCTTT TACACCAAAC TGTTCAACTA 54 0 

TAATTTGACG AATGTCATCT GGAAGATTGT CGTTTGAAAT AATCAAGTTC ATCTCTGGTT 600 

TAATGTCGAC GATATAGTTG TATTCTAATA G AC CAAC CTT TGCTGGTGTC TTTAAAGTCA 66 0 

TTTCATATTG ATTGTCTTTA ACT CTT ATGC GTAGTGCAGA GCGATGTTCC TTTAATTTGA 72 0 

AATCGGGTGT ATCAATATAG TAATTGACTT GCTTAmAAAG CACACTGTCT TTAAAATATT 780 

TCTCTTGCAA TTTATTATAG ATTGAtGCAG TTATCATTTG TtTAAATTCT ATtTCATGAT 84 0 

TTGTTGCCAT GATATGTATA CACCTCGTAT CAAATTCAAT TTATCTTAAC TATATTATGA 9 00 

ATGACAAAGT TGAATTTTAA AAGTAATTTC CTTTATCTAT TATCAATGTT AATTTGACCA 96 0 

TTAAAAATAG TGTTCGTAAG TGTTTTGTAT TATTGaATTG TGTTAAAATG TTATGGAATA 10 20 

AGAGGAGGAT TAAGCATGsG TTTwTATATT AATGAAATTA AAATTAAAGA TGACATACTT 1080 

TATT GTTATA CAGAAGATTC TATTAAAGGA TTATCTGAAG TAGGACAAAT GCTCGTTGAT 1140 

AGTGATAATT ATGCCTTTGC GTATACATTA GATGATGGTA AAGCGTATGC TTATCTCATT 12 00 

TTCGT 12 05 
(2) INFORMATION FOR SEQ ID NO: 572: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 
so TGAAGAAATA GCAATGATGA AATGC CGTAT GTTATATGAG ACGGGGTCAT TTCTTGAATT 60 
AAGAGAAGAA ACAATTGTCT TATTGAAAAC TGGCATACAA CAATATGATG CATTGATGAT 120 
TTATTACGTG AAAAGTTTGA TTGGTTTGGG ACAATATTTT GAAGCGGTAG AAGTAATTCA 180 
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ATTTGCTAAG TCAAAATTAA TTGAAGATGA AAAACGATTG ACTCAGTCAT TAGCTGATTT 3 00 

TGrTACGTTA TCAATGAGGG AACAGACGCA CTTGATTTTG AAGTTAATAG ACAATGGTCA 3 60 

TTTTCAATTT CAAGAAACGG TATTATATAT ATkAAaAyCT AATaCGTACa GTtATAACCT 42 0 

CATTAGTTTA ATGATTGAGT ATTTAAGGTT CGCAAATTGT ACACAAGAAC TGACAATTGA 480 

AAAGTATGGT ATGGATGTAA CTTTTGTACC AGCTAATTTA AAAGGGCTAG AACATACAAC 54 0 

ACTTAAAGAA AAAGTTATAC CTAACGTTAT 5 70 



(2) INFORMATION FOR SEQ ID NO: 573: 

;5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 

GTTGAATGGT TAGCAGCTGC AGTTGTATTA TATTTCTGTG GTGTAATTGT TGACGCTCAT 60 

GTATCATTCA TGTCCTTTAT TG CAATATTT ATCATTGCTG CATTATCAGG TTTAGTCAGC 120 

TTTATTCCTG GTGGTTTCGG CGCTTTCGAT TTAGTTG TAT TACTAGGATT TAAAACTTTA 180 

30 GGTGTCCCTG AGGAAAAAGT ATTATTAATG CTACTTCTAT ATCGTTTTGC GTACTATTTT 24 0 

GTACCGGTAA TTATTGCATT AATTTTATCA TCATTTGAAT TTGGTACATC AGCTAAGAAG 3 00 

TACATTGAGG GATCTAAATA CTTTATTCCT GCTAAAGATG TTACGTCATT TTTAATGTCT 360 

35 TATCAAAAGG ATATTATTGC TAAAATTCCA TCATTATCAT TAGCAATTTT AGTATTCTTT 4 20 

ACAAGTATGA TCTTTTTTGT AAATAACTTA ACGATTGTkT ACGATGCTTT tATATGATGG 4 80 

AAATCACTTA ACGTATTATA TTCTATtGGC AATTCATACT AGTGCTTGTT TATTACTTTT 54 0 

ACTGAATGTA GTTGGTATTT AT AAG CAAAG TAGACGTGCC ATTATCTTTG CTATGATTTC 6 00 

AATTTTATTA ATCACAGTGG CGACATTCTT CACTTACGCT TCATATATTT TAATAACATG 660 

4s GTTAGCTATT ATTTTTGTTC TGCTTATTGT AGCTTTCCGT AGAGCGAATA GGTTGAAACG 720 

CCCAGTAAGA ATGAGAAATA TAGTTGCAAT GCTTTTATTC AGTTTATTTA TTTTATATGT 78 0 

TAACCATATA TTTaTTGCTG GAACGTTATA TGCATTAGAT ATTTATACGA TTGAAATGCA 84 0 

50 TACATCTGTA TTG CGCTATT ACTTCTGGCT TACGATTTTA ATCATCGCTA TCATCATAGG 900 

TATGATTGCA TGGTTGTTTG ATTATCAATT TAGCAAAGT 939 
(2) INFORMATION FOR SEQ ID NO: 574: 
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(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

s 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

w G AATT AATTA AATATTACAC ACAGCCTCAT TTTTCATTTT CAAATAAATG GCTATATCAA 60 

TATGATAATG GAAACATTTA TGTTGAACTT AnGAGATATT CATGGTCAGC ACATATATCT 12 0 

TTATGGGGCG CTGAAaGTyG GGGAAATATT AATCAGTTAA AAGATCGTTA CGTAGATGTG 180 

15 TTTGGACTAA AAGACAAAGA TACTGATCAG TTATGGTGGT CTTATAGAGA GACATTTACA 24 0 

GGTGGCGTTA CACCAGCCGC AAAACCTTCT GATAAAACTT ATAATCTTTT TGTGCAATAC 3 00 

AAAGATAAAC TACAAACGAT TATTGGTGCG CATAAAATAT ACCAAGGCAA TAAACCAGTA 36 0 

20 

TTAACATTGA AAGAAATCGA TTTCCGTGCA CGAGAAGCGT TAATAAAAAA TAAAATATTA 420 

TATAACGAAA ATCG TAATAA AGGTAAGCTT AAGATCACCG GTGGCGGTAA TAACTACACT 4 80 

ATTGATTTAA GCAAAAGATT ACATTCAGAT CTAGCAAATG TTTATGTTAA AAATCCTAAT 54 0 

25 

AAAATAACTG TTGACGTCCT CTTTGATTAG TATATGAAGG TGACTTATAC TTCATGCACT 600 

TTAATTCCAA ATCAGATTAT TTAAATGATA ATTTTTAAAG TGTATGATGT ATATAATAGG 66 0 

30 TAAAATTTTC TATATATTTA AATGGAATTG GGAGTAGGAA TGTGACAGAA ATAGTATTTT 720 

ATAAAATTTA TTCtTGTCAC TCCCCAACTT GCACATTATT GTAAGCTGAC TTTCCGCCAG 780 

CTTCTATGTT GGGGCCCCGC CAACTTGCAT TGTCTGTAGA aTTTCTTTTT GAAATTCTCT 84 0 

35 ATGTTGGGGC CCCGCCTATA ATTGAAAAAT GCTTGTTACA TGGGCATTTT CATTCGGTCA 900 

ACTACTACCA ATATAATATT GtAGaGCCTA AGACATTGAT TTATTATGTC TTAGGCTCTA 960 

TTCCTTCATT TAATGATTAA nTTATTATAG CAATACTTTA TTGTCCCATG ATTAGTGTTC 1020 

TTTTAATG AG ACATAGTAAC TATAAAGTTT AATAATCGT 105 9 
(2) INFORMATION FOR SEQ ID NO: 575: 

4S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 
GTTTGCTTTA GGTCCTGTTT CATATTTATA CTTCGAAGGA TTTACCTTTT TGaAGTCTGG 6 0 
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TTCAACTkGC TTTTTATTCT TTTCGAAATC AGCTGGTTGA GTAGTTATGA GTTCATTATT 180 

TTTATTAGmA TAAATCTTAC CATTAACATA TTTATAATCT TTTGTTATAA AGTCACCATT 24 0 

TCTGAATGGA ACTACTTGAT TATGACCTTT AGAGAATAAA TCAGTACCGA ACATTAAATA 3 00 

GTTCTTCGTA TCTATACCAG CCAAATGTAA AATTGTTGGC ATTACATCGA CTTGACCAGC 3 60 

ATATTCATTA TTGATACCAC CAGATTTACC AGGGATTTTA ATCCAGAAAC CAGTTCTGTT 4 20 

TAAATCTGTA AATTTAGCCG GTGTGATTTT TTCACCTAAT AGTTTTTCCA TGGCATTGTT 480 

ATGGTTTTCA GAGATACCAT AGTGGTCACC ATAAATCATA ATCACTGAAT TGTCATATAA 54 0 

TCCTTTTTTC TTCAAGTCAT TAATATATTC TTCT 574 
(2) INFORMATION FOR SEQ ID NO: 576: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 

CAATGTTTTA TAGTACAATA TATTTTnAAT AATACTC GTT AAGGAGAATG ATATGATATC 60 

AATTCACGCA ATTTCGACAG GAAAAATCCA AGATTTGCCT TATAGCTCGA AAAGACCGAT 120 

GCGCTCTGCT TTAGATAAAA CTAAGATTTC ACAAACAACA TGGTTATCTT CAACTGGTTT 18 0 

CACTGGTGAT GAACAGGCTT ATAAAGATCA TGGTGGACCA CATAAAGCAG TTTGTGGGTT 24 0 

TAGTAAGCAT AATTATGCAC TGTATCAAGA TGATTTACCT ACACTACCTA CTCATGCGAT 3 00 

GTTTGGAGAG AATTTAACAT TTGATTATTT AGACGAATCT GATGTTTACT TTGGTAATCA 36 0 

ATATCGTTTA GGTGAAGCGT TAATTGAGGT TTCTGAAATT AGAGAACCAT ACTGGAAAAT 42 0 

TCAAGCAAAA TATAAT ATT C CTGATTTAGT GAAGCGCATG TCTACATCTG GTAAAACAGG 4 80 

TTTCTATTTC CGGGTATTAA AACAAGGCTA TGTATCTCCA AATGATCAGC TTTACTTAAT 54 0 

ACAAGAAGCA CCAATCGAAC ATCGTTTATC TGTACAACAG CTTAATGACC TTTATTATAA 6 00 

TGATAGACAA AATCAAGaTA TGTTACGATA TGCACTAAAC AATCCATTTC TGTCACCAAC 6 60 

AAGACGCGAT AAACTTCAAA AAATGTATAA CAGAACATTG GAAATAATTA CCTTTCATTn 72 0 

ATAAGTGTTA AATGAACTTT TCAAAACAnA AAGGAATCAA CTTCACACAT CGTTTGTATG 78 0 

AATAGTCTTA TCTATA 7 96 
(2) INFORMATION FOR SEQ ID NO: 577: 
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(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

s 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 

w AGAAATTATG ACAAAATATA AAGATGGAAA GTTAGTTTAT GCATCAGTCG AACCAGGATC 6 0 

TTACGTAATA CATAAAGATG ATGCAATTAA ATATGACGAT TATTCTAAGT TAAAAAAATT 12 0 

AAGTCAGCTA ACTAAACTTG ATCATCCAAA ACCAGTTCCA TATAGCGTaC TCAAATCAAA 180 

, 5 TCTTTCGGAG TACCTTTAAC AAGCGTTTCA TTTATGACAC ATGGATCAAA GGATACTAAA 24 0 

GATGAAGTGT TGCCGGCATT GGCCTATTTC ACTTTTTCAC CAAAAAATTA TGAAGACAAG 300 
TCTAATCCAG ATCCAAAAGT TTTAAATTTA GTACATA TGG ATTTCTTAAA TGCATCTAGT 360 

20 

GATTTTGGTA ACGCACATTT TGTTGTTTTA AGTAAATATA TTAAAGAGTA TGAATCAAAC 42 0 

TATGAAACAG CGTCAGATGA TTCTTTAAAA TAGTATTTAC TGTGTGAAAA ATAAATAGTG 4 80 

TACTACATTA AATAATCGCA ATAATAATCC CGATAAACAA TCAGCATTAC TGCTTATCAC 54 0 

25 

ATAGAGTTCG TAATAACTAT AACTCTATGA TTCGCAAATA ATAAATGATT GTCATCGGGA 6 00 

TTTATTTTTA TCAATTTATA AAGTGACATT ACCTTGTTCA TCAGCAGGTT TGAAAACAGT 66 0 

30 AATCACTGCA CTAATAATTG CTAAAATGTG TGGGATACCT GTC CAACAG A ATATTAAGTG 72 0 

TAGAATACCT TGCATATTCT TGCCGGCATA AAATTTATGA ATACCAAAAC TACCTAAGAA 780 
CAATGCTAAT AAAATATAAA TAACTTTGTT TACTTGCATT TCTTTCCCTC CAGTTGAATT 84 0 

35 GCTTATAATG ACATTAGCTT CTCTTTTTAT TATACCCACT TTTAGTTCAA ACATTCTAGT 900 
TTAAGCATTC CCAATCATCT AAATTTCAGT TATTCAATCC TTACAATAAA TTTAGGATTA 96 0 

CATTTCAGTT GCATTGTATT ATTTTACGTG TGAAATATAC GTAATGAATC ACATGACAAy 102 0 

40 

CTyCAAATTG AAAAATATAC ATTCTATGAT GTAAGGTCGC ATTTTTAATA TATTTACGTn 10 8 0 

AAAATAGTTT GGATG 1095 
4s <2) INFORMATION FOR SEQ ID NO: 578: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
so (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 
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ACTGCCAAAA TAATCATAGC CATGTTCTAC AG CTG CTTTC GCTACAATAT CCAAACGCAT 120 

TTCAAAACAA GCGGTACAAC GTAAGCCGCC TTCTTTTTCA TCAGCTAATT CTTTATCCTT 180 

CACCATTTTC ACAAACTTAT GTGGTTCATA AGGTGCTTCA AT ATACTT CA CATTCGCACC 240 

AGTCTTGCGA TTAAAATCTT CCACAAATTG TTCTTGCACT TTAGCACGTC GTAAGTACTC 30 0 

ATTTTTCGGA TGAATATTTG AATTCGCGAA ATAAATTGCA ATGTCTG CAT ATTGTGTTAA 360 

AAACTCTAAT GTATATGTAC TACAAGGTGC ACAACAACTA TGCAATAAGa TTTTAGGTCT 420 

GATTGCTTCT CTTTCCCACT GsCCGATTAA TnTCTTCAAC ACCTTGTCCA TAATTAATTT 480 

GTnGATTTn 489 
(2) INFORMATION FOR SEQ ID NO: 579: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

CGTACTGCAT CATTTGTTGA AAAATTTGCA AACTTCTTCA AGATGTTATG GCTTAGACTT 60 

AAAGCGATGA AGCACTACAA AGCCTTAAAT AAAGAATCTA AGAAGCAAGA ATTTGAAAAT 12 0 

TCATTCAAAG ATGTTCAAAA AATTATGCGT ATTGTGAATC ACAATATTAT TTTACGCTTA 180 

AAAGAAGAAC AAAATAGTAC AAATGTACTT GAGGTTAGCT TAGTCATTAA TCATTACTAT 24 0 

GATATGAGTC GCTCATTAAA GTGGCGTGCA CAACGTCGAA AAGAACGTCA AGAAAACAGC 300 

AATCAAATCA TACCGCAAGC TATGTT CCAT AACCACAAAT TGGAAGCATT GTACTTACAA 360 

CGTCATCTTT TAGATGAATT AATTCGCAAA AATAAAATCA ACAATATCGT TGCAGCTCAA 42 0 

ATT C GAG AAA ATATCAATTa CAACGAAATT GTCTTGTCTT TACAGTCCAA ACATTaAGCA 480 

AGaCwTaCaw TmCCCCCGTA CATATGAGAC AAAGTCATTA TCATCTCATA TGTACGGGGT 54 0 

TTTTATATTC AACATCAAAA AATCAGATTG ATGAAAAGTA AATAACCTTT CATCAATCCG 60 0 

ATTTGATTAT AGAATCTATT TTTTAAGTTT AAATGGAATT GTACATACGT TAACATTCTT 66 0 

TTGATAAATT AAATACAATT TCATACGCAA ACTAGTTTGA TTGTGTAATA AATTATGCCA 720 

ACGTTTCTTA GTAATAAATT CTGGTATCAC TACTGTAATC ATATAG TTTT GATCGTTGGC 780 

TTTACGATTA ATCTTATCGA TAAAACGTGA AATTGGTCGT ATAATACTGC GATATTCCGA 84 0 

ATGTAAAATA ACTAATCTTA CATCTGGGAA ATGACGTTTC CATTTCTCTT GGAATGCTTT 900 
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TGCGTAATAA ATAGACTTAT CAACTGCTGT TGTAATACTT GTTATCGGCA CAATTGCTAA 102 0 

ATTACGATCT ACCACGTCCA CATTAAGAAC ATCAATGTCA GAACGTAATT GTTCTGCGAT 1080 

ATCTCGATAA TGkTTGkTAA TTTTCAAGAA GAAAATCACC ACGAACGGCA AGAAAATAAG 1140 

TATCGGCCAT ACTTGGCTAA ATTTAGTTAT GAGTAAAATC cATAAnAACA ATAAATGTCA 12 00 

CGATACCACC AAGTAAGTTC ACAGACAACT TACTTAACCA ATTCTTAGGA CGTTCATGAA 12 6 0 

TCCATTTAAT AACCATACCG AATTGTG 1287 



(2) INFORMATION FOR SEQ ID NO: 580: 

JS (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 
GCCTCCTTTC CTATATCCAT GGGTGTCCTA GTTGGGAATG GCTTATTCCC TATCCCAAAC 6 0 

25 

ATCTGTCAAT TCAAGACTAT CACTGTATAC TAAATCGCCT nTCTATACnT CCATnCATTT 12 0 

CATTTTCCAG TTGGAACAAT TTGTTTAATT ATTTTAAATA TCATTATCAA TCCAGAAGTA 180 
TTTACTATTC ACTTTTACAA TAATCAATCA TTTAACTACA CATGGGTAGT CGGTGGTTTG 240 

30 

CTTGGGGTTA GCTTTTTAAC TGGCAATTTA TTGTTATTGC CAAAATTAGG TGCAACATTA 300 
ACTGTAATTG CAACAGTTGC GGGTCAAATT ATTATGGGTG TCATTATTGA TACATTTGGA 360 

35 TTATTTGGCG C T ACAATT C A TGATTTTAAT TTAATTAAAG CAATTGGAGT ATTGTTACTC 420 

ATTGTCGGCA TCGTCATAAT GAATCAATTT AACAAGAATA ATTTATTACT AACTGATCAA 480 
AAGTATTTAC TGTTTTGGCT TCTATTAGGA TTTATTTTTG GTTTCTTTCC ACCTATTCAA 540 

40 ACGACAATTA ATAGTGCTTT AGCTAGTCAT ACTCATTCAC CAGCCTTTGC ATCATTAGTA 600 
TCATTTACAA TTGGGTCAAT AGCGCTATTG ATTTTAACCG CTATTTTTAA TCGTTCTTTA 660 
AAACTAAAAA CAAGTCATTT AAAATTCGGT AAATTAAAGC CTATCTATTT TACTGGCGGT 720 

45 

ATACTTGGCA TGGCTTTTGT AACAGCTAAC ATTATCTTAA TGCCTCATAT GGGTGCAGCA 780 
TTAACAACAC TTATTGGGAT GTTTGGCCAG ATTCTAATGG G CAT ATT GAT AGATCACTTT 84 0 

50 GGATTATTTG GTTCACCTAA AATAGCAATG ACATCCAGAA AAACTATTGG TCTATTATGT 900 

ATTTTGACAG GCATTATACT TTTAAGATTA TTTTAAATTA ACTTTTAGCT TATCATTTTA 960 
ACTTGTAATT ATTTTTAAAA GTGATAAGCT ATTTTTTTGT GGTCTAAAAA TCTTTAGAAA 1020 
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CAACTCATTC TTAAGACCTA AATTAATGTT ATnTTTTAAT AATTTACACC AAATTAATAG 1140 
CAAAAATTAT GTTATTCGTG CT AATATTT C AT AG TTGGTT ATTCAATTAA TTAAAAATAA 12 00 

GTCAAAATGC ACAACTTTTT ATn 1223 
(2) INFORMATION FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581: 

CACCTTTTTA TCATGCTTAG TTATCAATAA ATCTATATTT TGCTGTTTTA CAATTTTTTT 60 

AACTTTATCA ATCTCATTAT CTTGGACTAA ATAAATATAT GATCTTGCAT CTGTTGCTAG 120 

AGCTTGTTCG TGTTTTTCTG ATAAAACATA TGTGATGGAA GCGTGAATAA TAATGCCTAA 180 

TGTAACAAAA CTGATAATTA AT ATACTG CT TATCAATAAC ATTAAGCGGT GGTGAAACTT 24 0 

CATCATTGTT CTTTAGGT CT TT CCAATTT A TAG CCTAAG C CACGCACAGT TTTAATAAGT 3 00 

TGTGGCTTCT TAGGATTATC TTCTAATTTA TCTCTTAAAT GACTGATATG TACATCAACA 3 60 

ATTCTTGAGT CTCCTGCAAA TTCATAATTC CATACCGTAT TTAACATATG CTCTCTCGTA 420 

ATGACTCTGC CTTGTCTTTC TATCAAATAA AGCA 4 54 
(2) INFORMATION FOR SEQ ID NO: 5 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 

TTTATAAGAT TTTATTTACA AATAATTGGT TTTCATATGT ATAAACACTT TTGACTTTCA 60 

AATCTTAACG ATAATTCTAT TACAATACAA TCCCCTATTA GAATGATTTA TGTAATAAAA 120 

AAAGCGGAGT TTCCCCCAGC TTTTCTAAAC GACTACATAA AATATAAGAT TGCAATTAAA 180 

TGCAATAGTG ATGCTATTAC AATAAAAATA TGCCAAATCA TATGAAAATA TGGTCTATTC 24 0 

TTTTGTG CAT AAAACCATGC ACCAATTGTA TAAGACACAC CACCTAAGAA AATGAATAAT 3 00 

ATGAATATCC ATGATGTGCG AATAAAAATA ATTGGTAACA AGATAATACC TACCCAGCCC 36 0 
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AAAATCCCCC AAAGTGTCGT TCCCCATAAT AA 4 52 
(2) INFORMATION FOR SEQ ID NO: 583: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 583: 

CGCTTTTTGC AATAAGTCAT TAGCCGCTTT TAAGCCTTCT TCTTTTCGAT CTACAACAAG 6 0 

TAAAATAAAT GGCTTTAACG CTTCTTCTTT TTCACTTTCA AG CATAT CTG GTTTTTGAAC 120 

CATTTCAAAT GGAGATTTCA ATCCATTATT ATCGCTCATT TCAATAATTG CATCATACTG 18 0 

20 TGCTTGTGAC ATACTTGCAA TAGCCTGTTT TGCATTTTCT TGAAGGAAAT ATAAGTTTTT 24 0 

CAATTTAGGA TGCTTATTTA ATGTACTTAA TGTAATCGGT GTAATGTCTT TCTCATAAGA 3 00 

CACTTCAATC ACTGTACTAT TTGTTCTACC AGGAATTGGT GGTTTTT CAT GAATATGCTT 3 60 

25 TGATACTTCT CCAATTCCAA CGACAGATTG ATTTTTCGTT CGATTATAAA AAATAATATT 420 

GTCGCCTTCT TCTAACTGAG TATAAAAATG ATAACCATTA CGTTTAATAC CGTTGTACGT 4 80 

GTG CGTATAA ATCGTATATT GGTTTCCAGG TTCAAATTCT TCAGTTTCAG CTAAAAAGAA 54 0 

ATAACG CGGT ATCTTAATTT CGCCTTTACC AAGAC CACTT ATTAAATCAA ACTCTTCTGC 6 00 

AGTGATTTGA TTGAACAATG TCTCTTTCAT ATnACTTATA CGAAATTCCA AAGCTTCACT 660 

3£ ACGCTTTAAA TAATCTGCTG TTAATGGTTT CAATTGTTCA TTAAAACGAA ACTGTACACG 72 0 

TATTTTATTT TGTGCACCTG TTTCAACACT AATAATTTCA CCACATCCAA GTAGTCCAGT 78 0 

ATCCGTCTGA ACTTGATAAA AGATGACTTG ATCTCCTACT TTAGCCTTTT TAAACGCTCT 84 0 

40 AAATCCTTGA GATGG GTTAA AATGTGCGCC TGATTCAAAT AAAGCTGTTT GTCCTACTAA 90 0 

CGGTTCATTA TGATTCCAAC GGTTATATCC ACAATTCAAC CAAAAATAAT TCGTTTCTGC 96 0 

TGTCATCTTA ATACTCCTTA ACCTGAATAA ATTTTAGAAA CACTATGAAT TACATTCTTT 102 0 

45 tAGTGTTTCT TATGCAGTTG GACGCGTATG CGAACAACTG TATACCCTTT GTTCACTGCG 10 80 

ATTTTAATCG CATTTCCTAT AACATTGTAG CGCCCAGGAC ATTAATTTAC GTCCCAGACC 114 0 

CTTATCGTTT TCACTTCTAA GTAAGTCGAA CTATTTTGCT TTACAACAAG TGCGACTCTA 12 00 

50 

AATACAGTTG GACACACATA CGAGCAACTG TATACCTTTT AATCAGTTTT CTATATTTTA 126 0 

TTTATTATAT CTGTCTTAAT GATAAAAATT GTTACAAACA GTTTAACATA TTTAGCTACC 132 0 
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ATGATACCAC TATGCTTGCn TATCTCTATA GCGC CATTGA TACACATTTT TAAATATCTA 14 4 0 

TACTGCCGTT AGAATTTTAT CATGTCTnAA TT 14 7 2 
(2) INFORMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

GGTAGTGAAT GGGGTTCAAG ACAACAGTAT TGGAAGTACG AATGAATCAC AGTTTTTAGG 60 

AAATTATATT GTAATTAAGC ACGCAGAAAA TGAGTATAGC TTAATAGCTC ATTTACACCA 12 0 

ATATTCAATC ATTGTGAATG AGGGGCAAAA TGTTAAATAT GGTGATATCA TTGGGAAGGT 18 0 

TGGGAATTCT GGCAATTCTA CTGAACCTCA TATACATTTT CAAGTAATGA ATGATAAGAA 24 0 

TATTGAAGCA TGTACATCTT TAAAAATTCG ATTTATAAAT AATCGAGAAC TTATCAAAGG 3 00 

GGATGTGGTC TGCGGATTAC AAGCTGAATG ATGGCGATAC TTAT AAAAT C TCGACACTAT 360 

AAAAATGGTA TAGTGTCGAG ATTTTCTTGC TTATTTAGTT AATTCAAAGT GCACGCCGGA 420 

TTCATTAGAA GTCGACGTAT TTTTGTTTGT AATAGAGTAA CCGGTCATTG AAATTTTAGA 4 80 

TTCAATATCT G AAGCG GAAT TTGTAGATTC AGGATTATAG AAGCTACATT CATAAGTGTT 54 0 

ATCAT CTTTC TTTTTAAGTA TAAACATACC TTTGGCTTTA ACTTCGACTT TAGTGTTGTT 6 00 

AATGTCAAAA GTTTGAGTGC TATTACTATA ATTAACACCA GCCCAAACCG ATTCATTATC 6 60 

TTTCACAACG GGGAAGTCAT CTTCTTGCTT AACGACGTnA CTTCATCTTT CTCTGTCTTA 72 0 

AAG A CAT CTT TAGATAAGCC TGGATACAAC ACATATCCAT ATTTATTGTC AGAATTAGAA 780 

TGCTTTT 7 87 
(2) INFORMATION FOR SEQ ID NO: 585: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 
ATG CTAAATT AGGGGGAATA TTTACGATAA AGAGACCAGA AAAATAATAA AACACATG CT 60 
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TAAAATGAAG ACTATTTTTT ATTACAAGAA AATGTATCTA GTAAACTTAA AGTAGCAAGA 180 

CCTAATAAAT TTAATGCATG TTGTGCACCT TTTTTACCTT GGCCAGCTTC GAAATGTTTG 24 0 

TAAGCAGCTA CACTTAAAAT GCCTATCGTT GATAGTGATG CAAGG CG AG A AATG TTTTTA 3 00 

TTGATAAAGC TAGCTGAGTA TAAAGCAGCA GTAGTTGCTT CTGCAATGCC GACGTATTTT 360 

ACAAGTTCTT TTTGCAAGCC AAAAGTATGT TCAAACAGTT CAATCATACC CTTATCTTCT 420 

TGCAATTTAG GTTTACTGGC TTGGTATAGC TCTTTCGCAA GTTTTAAATT CGTTGCGTAA 4 80 

CGCAAAATCA TATTTAATTC CTCCCAATAT TTGATTTTTT GTGAAAGATG ATTACTTTAT 54 0 

CATTTTTACC CGTTTCTATA AAAATGAATC AATTATGTAA CGTATG TGT A GTTTAGGAAT 6 00 

GTTTG CTATG GAAATATAAT TCTGTTCACT CAAAATGTAT GAAATTAATG TGTAGTTTTG 66 0 

TCGAGTTGCT CTTTTAATTT GGTTAGATTG TTTTTTAGAG AAGCGGTACT ATTTTTAAGT 72 0 

GCATCAACAG ATTTACCTTC GTTTTGAGAC ATTGAGTTTA TTACAGCACG AAGTTCTGTT 780 

TCTAGTATGT CaGCGTCGCt TTAGCATTAG AACTTAaTAt TTAt Ac TCTT 83 0 
(2) INFORMATION FOR SEQ ID NO: 5 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 

TTAGGACGTT TTTACAATCA GTACAATGAG CTCATTGTTA TTAGTCCTTT AACGGCGTCT 60 

TTTAATGCTG G CGCTACATT TGGGCGATTT CATCATTTAA TTGATACTGA AACTTTAGCA 120 

AAATTAGAAC ATGAAAAAGG ACATTATTAT CAGAAGATGA TATGTGATGA CAATGTAGAA 180 

ATGATTTCTA TAAATAACAT ACCGAAATAT CCGAGAAATC ATAATGTATT AACTAATCAT 24 0 

GACTCATACG AATATTCATT GAATTTAGGA AGTAGTAATA GTTATTCAAA GTATGAGCTT 3 00 

ACCTTAGATG ATATTTATGT TGGTGCTACC TTTtAACAAA TTATATTTAT ATTCTAGCCm 360 

AC t AAAT AAA AGGGkaCtaT TTGaATCaAA CmATaTGtAT TAACCTTTTT TA 412 
(2) INFORMATION FOR SEQ ID NO: 587: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 

TTCAGTTTGA AATTAATCAT ATAAATTTCT TATGGGAGGG TTGATATCTT AATGATTAAC 60 

5 ATTATTTCAG CTATAGGATC TATT GGAACA TTTATTATGG CTTTATTTTA TTTTGTATCA 12 0 

GTTTCAGTTC AACTTTATCA AATGAAAATT AGCTTTCTGC CAGCTTTAGG TTTTAACCAA 180 

ATTTTATTAG AAAGGGAGGA GGATCAACTT AATATAATGA ATTCGGCAAC AGAAGAG CAT 24 0 

to 

CATCATAAAG ATTATATTAA ACTATATAAT TTAGGTGGCG GTGCTGCTAA rAAAATTGCA 3 00 

ATAGAGGTTT TATTGGGGAA GGATArAGTC ATTCAGAAAA AATACGTGCA TATTkTACCT 36 0 

is AGTAAAGAAG GGTACATGTT AC CAATTAAT AAAAATGTGT ACGAAGAATT AGAAAGAACG 420 

ATTGAsAACA ATGGTCATGA AGCTGATTTG AATGTACGTA TGACTTATTA TCATAATGTA 4 80 

AGTCGCAAAC AACAGGAAGT TATATTAAAA GGTCAAATCG ACCGTTTTAA TACTTATAAT 54 0 

20 AATAAAGAAA TTTATGATTT GCAGTTTATC TAAAAATTGA TTTAAGAGGG TAGTTGTTTA 600 

TTGCGAAAAA TATCATTCAA TTTTAATGAA ATAATGGCGT CATTACTATA AAATATTACT 660 

TTATGTTGTA ATGCATTTTT CTATAAGATA GAACTAAAAG GAGGGGCAAA GATGCAAATT 720 

25 AGACAAATAC ATCAACATGA CTTTGCTCAA GTGGACCAGT TAATTAGAAC GGCATTTGAA 780 

AATAGTGAAC ATGGTTATGG TAATGAATCA GAGCTAGTAG ACCAAATTCG TCTAAGTGAT 84 0 

ACGTATGACA ATACCTTAGA ATTAGTAGCT GTTCTTCAAA ATGAAGTTGT AGGGCACGGT 900 

30 

TTACTAAGTG AAGTTTATCT TGATAACGAG GCACAACGGG AAATTGGATT AGTGTT AG CA 96 0 

CCTGTATCTG TTGATATTCA TCATCAAAAT AAAGGTATTG GGAAGCGATT GATTCAAGCA 102 0 

3S TTAGAACGAG AAGCAATATT AAAAGGATAT AATTTTATCA GTGTATTAGG ATGGCCGACG 1080 

TATTATGCCA ATCTAGGATA TCAACGCGCA AGTATGTACG ACATTTATCC ACCATATGAT 114 0 

GGTATACCAG ACGAAGCGTT TTTAATTAAA GAATTAAAAG TGAACAGTTT AGCGGGAAAA 1200 

40 ACAGGTACCA TAAATTACAC ATCTGCTTTT GAAAAAATAT GATTTCAAGC TAGGATTACA 126 0 

TTAGGTAGAG TTCATATTAA TAATAAAAAA TGTTTGCAAT CAAATCGTAC GTTGTCGTTT 13 20 

GTAATTCTTA AAATAGCAAT AAATAAAATG TTTGTTAGTA AAGTATTATT GTGGATAATA 13 8 0 

45 

AAATATCGAT ACAAATTAAT TGCTATAATG CAaTTTTAGT GTATAATT CC ATTGACAGAG 144 0 

ATTAAATATA TCTTTAAAGG GTATATAGTT AATATAAAAT GACTTTTTAA AAAGAGGGAA 15 00 

TAAAATGAAT ATGAAGAAAA AAGAAAAACA CGCAATTCGG AAAAAATCGA TTGGCGTGGC 1560 

so 

TTCAGTGCTT GTAGGTACGT TAATCGGTTT TGGACTACTC AG CAGTAAAG AAGCAGATGC 162 0 

AAGTGAAAAT AGTGTT ACGC AATCTGATAG CGCAAGTAAC GAAAGCAAAA GTAATGATTC 16 80 

55 
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GTCAAACACT AATAATGGCG AAACGAGTGT GGCGCAAAAT CCAGCACAAC AGGAAACGAC 1800 

ACAATCATCA TCAACAAATG CAACTACGGA AGAAACGCCG GTAACTGGTG AAGCTACTAC 186 0 

s 

TACGACAACG AATCAAGCTA ATACAC CGGC AACAACTCAA TCAAG CAATA CAAATGCGGA 1920 

GGAATTAGTG AATCAAACAA GTAATGAAAC GACTTCTAAT GATACTAATA CAGTATCATC 1980 

TGTAAATTCA CCTCAAAATT CTACAAATGC GGAAAATGTT TCAACAACGC AAGATACTTC 204 0 

W 

AACTGAAGCA ACACCTTCAA ACAATGAATC AGCTCCACAG AGTACAGATG CAAGTAATAA 2100 

AGATGTAGTT AATCAAG CGG TTAATACAAG TGCGCCTAGA ATGAGAGCAT TTAGTTTAGC 2160 

,5 GGCAGTAGCT GCAGATGCAC CGGTAGCTGG CACAGATATT ACGAATCAGT TGACGAATGT 2220 

GACAGTTGGT ATTGACTCTG GTACGACTGT GTATCCGCAC CAAGCAGGTT ATGTCAAACT 228 0 

GAATTATGGT TTTTCAGTGC CTAATTCTGC TGTTAAAGGT GACACATTCA AAATAACTGT 234 0 

20 ACCTAAAGAA TTAAACTTAA ATGGTGTAAC TTCAACTGCT AAAGTGCCAC CAATTATGGC 240 0 

TGGAGATCAA GTATTGGCAA ATGGTGTAAT CGATAGTGAT GGTAATGTTA TTTATACATT 24 60 

TACAGACTAT GTAAATACTA AAGATGATGT AAAAGCAACT TTGACCATGC CCGCTTATAT 2520 

25 

TGACCCTGAA AATGTTAAAA AGACAGGTAA TGTGACATTG GCTACTGGCA TAGGTAGTAC 2580 

AACAGCAAAC AAAACAGTAT TAGTAGATTA TGAAAAATAT GGTAAGTTTT ATAACTTATC 2 64 0 

TATTAAAGGT ACAATTGACC AAATCGATAA AACAAATAAT ACGTATCGTC AGACAATTTA 2 700 

30 

TGTCAATCCA AGTGGAGATA ACGTTATTGC GCCGGTTTTA ACAGGTAATT TAAAACCAAA 2760 

TACGGATAGT AATGCATTAA TAGATCAGCA AAATACAAGT ATTAAAGTAT ATAAAGTAGA 2 820 

35 TAATGCAGCT GATTTATCTG AAAGTTACTT TGTGAATCCA GAAAACTTTG AGGATGTCAC 2880 

TAATAGTGTG AATATTACAT TCCCAAATCC AAAT CAAT AT AAAGTAGAGT TTAATACGCC 294 0 

TGATGATCAA ATTACAACAC CGTATATAGT AGTTGTTAAT GGTCATATTG ATCCGAATAG 3 000 

40 CAAAGGTGAT TTAGCTTTAC GTTCAACTTT ATATGGGTAT AACTCGAATA TAATTTGGCG 3 060 

CTCTATGTCA TGGGACAACG AAGTAGCATT TAATAACGGA TCAGGTTCTG GTGACGGTAT 3120 

CGATAAACCA GTTGTTCCTG AACAACCTGA TGAG CCTGGT GAAATTGAAC CAATTCCAGA 3180 

45 

GGATTCAGAT TCTGACCCAG GTTCAGATTC TGGCAGCGAT TCTAATTCAG ATAG CGGTTC 3240 

AGATTCGGGT AGTGATTCTA CATCAGATAG TGGTTCAGAT TCAGCGAGTG ATTCAGATTC 33 00 

so AGCAAGTGAT TCAGACTCAG CGAGTGATTC AGATTCAGCA AGCGATTCCG ACTCAGCGAG 3360 

CGATTCCGAC TCAGACAATG ACTCGGATTC AGATAGCGAT TCTGACTCAG ACAGTGACTC 342 0 

AGATTCCGAC AGTGACTCAG ATTCAGATAG CGATTCTGAC TCAGACAGTG ACTCGGATTC 34 80 

55 
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CGATTCTGAC TCCGACAGTG ATTCCGACTC AGACAGCGAT TCAGATTCCG ACAGTGATTC 3 600 

CGACTCAGAT AGCGATTCCG ACT CAG AT AG CGACTCAGAT TCAGACAGCG ATTCAGATTC 3 66 0 

AGACAGCGAT TCAGATTCAG ATAGCGATTC AGATTCCGAC AGTGACTCAG ATTCCGACAG 3 720 

TGACTCGGAT TCAGATAGCG ATTCAGATTC CGACAGTGAC TCAGATTCCG ACAGTGACTC 3780 

AGACTCAGAC AGTGATTCGG ATTCAGCGAG TGATTCGGAT TCAGATAGTG ATTCCGACTC 3 84 0 

CGACAGTGAC TCGGATTCAG AT AG CGACT C AGACTCGGAT AGCGACTCGG ATTCAGATAG 3 900 

CGATTCGGAC TCAGATAGCG ATTCAGAATC AGACAGCGAT TCAGATTCAG ACAGCGACTC 3 96 0 

AGACAGTGAC TCAGATTCAG ATAGTGACTC GGATTCAGCG AGTGATTCAG ACTCAGGTAG 4020 

TGACTCCGAT TCATCAAGTG ATTCCGACTC AGAAAGTGAT TCAAATAGCG ATTCCGAGTC 4 080 

AGTTTCTAAC AATAATGTAG TTCCGCCTAA TTCACCTAAA AATGGTACTA ATGCTT CTAA 414 0 

TAAAAATGAG G CTAAAG ATA GTAAAGAACC ATTACCAGAT ACAGGTTCTG AAGATGAAGC 4200 

AAATACGT CA CTAATTTGGG GATTATTAGC ATCAATAGGT TCATTACTAC TTTTCAGAAG 4260 

AAAAAAAGAA AATAAAGATA AGAAATAAGT AATAATGATA TTAAATTAAT CATATGATTC 4 320 

ATGAAgnAAc rCCTTAAAAG GTGGCTTTTT TACTTGGATT TTCCAAATAT ATTGTTTGAA 4 380 

TATAATTAAT AATTAATTCA TCAACAGTTA ATTATTTTAA AAAGGTAGAT GTTATATAAT 444 0 

TTGGCTTGGC GAAAAAATAG GGTGTAAGGT AGGTTGTTAA TTAGGGAAAA TTAAGGAGAA 4 500 

AATACAGTTG AAAAATAAAT TGCTAGTTTT ATCATTGGGA GCATTATGTG TATCACAAAT 4 560 

TTGGGAAAGT AATCGTGCGA GTGCAGTGGT TTCTGGGGAG AAGAATCCAT ATGTATCTAG 4 620 

TCGTTGAAAC TGACTAATAA TAAAAATAAA TCTAGAACAG TAGAAGAGTA TAAGAAAAGA 4 680 

TTGGATGATT TCAATATGGT CCnTTCCCA 4 709 
(2) INFORMATION FOR SEQ ID NO: 588: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 

CTTTTTTAAt TAwCgGaAtA TTGtCaTgaT tAcAcTTCGt TAGGGTTTAc gTCGtAATTT 60 

CATTTa a TAA gCgCTTCAcC ATTAAAtGTG g TAmCCTTT A ATTCGCCAGT AGAAACATCA 120 

CAGTAACTAA GCGCAATTTC AGGTTGATTC ATAACAAAAC TTAAAATATA GTTATTTTGT 180 
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CGTCTAACCA 


TACCTTTCGT 


TTGTTTCGGA 


TCTTCCATCT 


GTTCACAAAT 


AGCTACTTTA 


300 


TATCCATTAT 


TAACAAGTGT 


ATCTATATAA 


CTATCTG CAG 


AATGATACGG 


AACACCACAC 


360 


ATCGGAATTG 


GATTTTCTTT 


TTTAGCATCT 


CTTTTAGTTA 


AAGTAATTTC 


AAGTAyACGT 


420 


GATGCCTCCT 


TGGCATCTTC 


ATAAAACATT 


TCATAGAAAT 


CACCTAGTCT 


AAAAAATAAT 


480 


AAGCAATCTT 


GGTATTCTGA 


TTTTATTTTT 


AAATATTGCT 


GCATCATTGG 


TGTAACATTA 


540 


GACATATTAT 


TTCTTCACAA 


CCCTTGTCTC 


TTTTTAAAAT 


TTGTCTTTAC 


AATATATTCG 


600 


TTTGTAAGyT 


TTTTAATTAT 


TAATTATTTA 


ACTTATACAT 


TTTAACATAC 


TTACTTTTAC 


660 


AAACCTATTC 


ATACCATATA 


ATCACGAAGC 


ATCTTAAATG 


TATAAGAAAA 


CGCCTCAAAC 


720 


CTAATAAAAT 


GTGTCAATAG 


CATGTTTAGA 


ATTAAATTAA 


AATTCTAACA 


TTCAAGACAT 


780 


TTAATTAAGT 


AAGGGCGTTC 


AATATTAAAA 


TGAACAATGA 


CTCTGTTTGA 


AATCATATAT 


840 


CATAAAATTA 


TTTTATAAAC 


CTTTGAAGAA 


TACCACGTTT 


TTTTAGAGTA 


ATT AATAA GA 


900 


AATAA CTTAT 


AATAGATCCG 


ATAGCACTTG 


aGACTATGaA 


CGTAATCATT 


AACGGTTTAA 


960 


TGAAGAAGTC 


TTGAAGCCCA 


AGGaAATATG 


CTAATGG t AT 


a CaAATTAAA 


cTTCCgATGA 


1020 


CaCCAGTTCC 


aAGTACTTCa 


CCGACCGCGG 


CCaTAAATAT 


ATGTTTACGA 


TATnygTAAA 


1080 


ACATACTAGC 


CAATAAAACT 


CCAATCATAC 


TACCCGGAAA 


TGCAAAAGst 


GTACCAGTAC 


1140 


CAAAAAGAAC 


TCTTAAAATT 


GATGATATAA 


GCGCTTGAGC 


TAATCCATAC 


CAAGGACCTA 


1200 


CTATGACCGC 


ACTTAATACA 


TTTACAAAAT 


GCTGTACTGG 


TGCTGCCTTA 


ACTGGTCCTA 


1260 


GAGGAATGAT 


GATAATACTG 


CTTAATACAA 


CATTTATTGC 


AATTAAAAGT 


GCAGTTATAG 


1320 


CCAGTTTTCT 


TGATTTCATA 


TGATT GTTCT 


CCTTTT'TGTT 


TGTAATTAAT 


CACTATGCTT 


1380 


GGCTTTATTA 


TGGTCATTTA 


AACGTGTTTC 


CATTGTTGAT 


ACAAACATTT 


TCAATAATTG 


1440 


ATTCGCTTCA 


TATTGTGAAG 


TTTGAAACTG 


TTCAACTATG 


GGCAATGTAT 


TTATTTCTGC 


1500 


TTCTAt ACT C 


TGAATGGtAT 


GTTCCGACTG 


nTCCAGCGCA 


TTTGTTTCCC 


GnAA 


1554 



(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 
4S (A) LENGTH: 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 9: 
TTTTGACGAA AAAAGTTGAT TTACAAATAT ATAAACGTTG TGATTTCAAT GTTTGTATAG 60 

55 



1522 



EP0 786 519 A2 

TATATACATG ACAGCAACTT GGGAAAAAAA GGAAGGTAAC GAAGGTTTAT TAACTGTTAC 180 

TGTTCCTGCA GAAAAAGTAA ACAAAGCrCT GAGAGATCCC CTCATAATTT CCCCAAAGCG 24 0 

5 TAACCATGTG TGAATAAATT TTGAGCTAGT AGGGTTGCAG CCACGAGTAA GTCTTCCCTT 3 00 

GTTATTGTGT AGCCAGAATG CCGCAAAACT TCCATGCCTA AGCGAACTGT TGAGAGTACG 36 0 

TTT CGATTTC TGACTGTGTT AGCCTGGAAG TGCTTGTCCC AACCTTGTTT CTGAGCATGA 4 20 

W ACGsCCGCAA GCCAACATGT TAGTTGAAGC ATCAGGGCGA TTAG CAGCAT GmTATCAAAA 480 

CGCTCTGAGC TGCTCGTTCG G CT ATGGCGT AGGCCTAGTC CGTAGgCAGG ACTTTTCAAG 540 

TCTCGGAAGG yTTCTTCAAT CTGCATTCGC TTCGAATAGA TATTAACAAG TTGTTTGGGT 600 

15 

GTTcGAATTk CAACArGTaA GTtAGtTGCT AGAnCCCA 6 38 



(2) INFORMATION FOR SEQ ID NO: 590: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 



AAAATATTCC CGTACATTTT 


GATGTCTGTA 


GGGGCTTTTT 


TGACTTTAGG 


ATTTGTCATT 


60 


TTTTCAATTC ATAAAGGGAG 


ACGAACGAAA AATGAATCAG 


CACGTAAAAG 


TAACATTTGA 


120 


TTTTACTAAT TATAATTACG 


GCACATATGA 


CTTAGCAGTA 


CCAGCATATT 


TACCGATAAA 


180 


AAACTTAATA GCTTTAGTAT 


TGGATAGTTT 


GGACATTTCA 


ATATTTGATG 


TCAATACACA 


240 


AATTAAAGTG ATGACGAAAG 


GTCAATTACT 


TGTTGAAAAT 


GATCGACTCA 


TTGATTATCA 


300 


AATCG CTG AT GGAGATATTT 


TGAAGTTACT 


ATAGGAGGGA 


AAATAGATGG 


TTAAAAATCA 


360 


TAACCCTAAA AATGAAATGC 


aAGATATGTT 


AACGCCTTTA 


GATGCTGAAG 


AAGCAGCTAA 


420 


AACAAAATTA CGCTTAGATA 


TGAGAGAGAT 


TCCTAAGTCT 


TCAATTAAAC 


CAGAACATTT 


480 


TCATTTAATG TACTTATTAG 


AACAACATTC 


TCCATATTTT 


AT AG ATG CTG 


AATTAACTGA 


540 


ACTACGTGAC aGTTTCCaAA 


TACATtATGA 


CATTAATGAC 


AATCATACAC 


CTTTTGATAA 


600 


TATTAAATCA TTTACTAAAA 


ATGAAAAATT 


ACGTTACTTA 


CTCAATATCA 


AAAATTTAGA 


660 


AGAAgTAAAT CGTACACGCT 


ACACATTTGT 


GTTGGCACCA 


GATGAATTAT 


TTTTCACAAG 


720 


AGATGGATTA CCCATTGCTA 


AAACAAGAGG 


GTTACAAAAT 


GTTGTTGATC 


CATTACCTGT 


780 


GTCAGAAGCT GAATTTTTAA 


CAAGATATAA 


AGCGCTGGTT 


ATCTGTGCAT 


TCAATGAGAA 


840 



55 
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AACTAAAGTT ATTGAAGCGG CAACGTTAGA TTTACTAACG GCATTTTTAG ATGAACAGTA 96 0 

TCAGAAACAA GAACAAGATT ATAGTCAAAA TTATGCATAT GTACGCAAAG TAGGACATAC 1020 

CGTTTTCAAA TGGGTTGCTA TCGGTATGAC AACGTTAAGT GTTTTATTAA TTGCATTCTT 1080 

AGCCTTTTTA TATTTTTCAG TAATGAAGCA TAATGAGCGC ATTGAAAAAG G ATACCAAG C 114 0 

ATTTG TAAAG GATGtTATAC GCAAGTACTA AATACGTATG ATGATTTAGA TGGTaAAAAt 1200 

TgaTAAAGAG GCACTTTACA TTTATGCCAA AAGTTATATC CA 12 42 



(2) INFORMATION FOR SEQ ID NO: 591: 

J5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 



TTCCAGATAG AGCCTTAGTT GCTGCCGCTG AATTGTCTGA TCGTTACATC ACTGATCGTT 60 

25 TTTTTACCAG ATAAAGCGAT TGATTTAGTT GACCAAGCAT GTG CAACAAT TCGTACGGAA 120 

ATGGG AT CAA ATCCAACTGA ATTGGATCAA GTTAATAGAC GTGTCATGCA ATTAGAAATT 1B0 

GAAGAAAGCG CACTTAAAAA TGAATCTGAC AATGCGAGCA AACAGAGATT ACAAGAACTA 24 0 

CAAGAAGAGC TTGCCAATGA AAAAGAGAAA CAAGCAGCAC TTCAATCTCG TGTAGAATCA 3 00 

GAAAAAGAAA AAATAGCAAA TTTACAAGAA AAACGTGCGC AACTAGATGA AAGTAGACAA 3 60 

35 GCGTTGGAAG ATG CACAAAC AAATAACAAT TTAGAAAAAG CTGCTGAACT ACAATATGGA 4 20 

ACAATTCCTC AATTGGAAAA AGAACTTAGA GAATTAGAGG ATAATTTCCA AGATGAGCAA 4 80 

GGTGAAGATA CAGATCGAAT GATTCGTGAA GTTGTAACAG ACGAAGAAAT TGGCGATATT 54 0 

40 GTCAGCCAAT GGACAGGCAT ACCAGTTTCA AAATTAGTTG AAACAGAACG TGAAAAATTA 600 

CTTCACTTAA GTGACATCTT G CATAAACGT GTTGTAGGTC AAGATAAAGC GGTTGACCTG 660 

GTTTCAGATG CAGTAGTTAG AG CAAGAGCA GGTATTAAAG TnCAAACAGA CCTATTGG TA 720 

45 GTTTCTnATT CCTAGGTCCn ACTG 744 



(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

TTG TTATAGT GTCTGAATCT GTGTAACTCT TGCCCATGTG TTCTGAGTAA AGCACCCACT 6 0 

GTTTATTTAC TTTTCGTTGT AGTCTAGCTT CGTGTAGTAG TTTGTTTAAC TTTTGTGCTG 12 0 

ATATACCGTA GTCTGCCGCG ATTTGAGTTG TGGCTAATGT GCCAGTTGAT TTTAAGATTT 180 

CATCAACATA ATCTGCTTTG GGTTTTAGCT CTCCAATTTC TTGTTGTAAA AGTAAGTTTT 24 0 

GCTCTTTTTC TTTCTTATAC TCAGTCAACA CTGTAATGAT GTAGTCTGGA TCTTTTAATG 300 

TTTGTTCAAT TACATTGTCT GTTGCGTATA TACCGTGTTT GCGAATAGCT C-GTaGGACAT 360 

CTGATGTTAC CCAGCGTTTG AATTTTCTAG CGGTTTCTCT AATTTTTTCG TTTTTGCTTT 4 20 

GTTTAGAAGC ATCGAAGATT AGACTGTATA ATCCTGATTC GTTGATAATG ATCATATTTC 480 

TGTTTTGACC TGATGCACTA AATTGGTGCG TCAGTTTG TC CTCGCTATCA ACATGATTTC 54 0 

TAATGGCATT GTCTGATCTT GCATATCCTA AAATCTCAG C AATATCTTTT CCTACAAAAT 600 

AAGGTTCGTT TTCAATTTCT ACTGTTCTTA CTGGTAGCTC TTTAAAATTA AATGTTTGTA 66 0 

ATGCTTGCAT TTGAGTATCC TCCTTTTTCC TCAACACCCA CATT CAGCAG ACGGTTATCG 720 

CAATGACTAT CGAATGTATT TAAACGCGGC TCATATCATC GCCAGcTCTC GCTCACATCT 7 80 

GCT CAATGTG GATGTTGATA AGCGTGGTTA TATTAAGAAG TGAATGTTAC TGATTCACTT 84 0 

TCCGCCACTC TGTTAAATCA GTAACTTTGT TATCGCTTTC AACACCGTTA AGCTTGTCTA 90 0 

ACGCTTTCAC TACTTTTTGG AACTCTTTGA TAGCACTtCG TAGCTTTTTA GTAATTTCAT 96 0 

CTTCTACCAT TTCCAAACCA GCAAATGCGT CTTCGTTATT CATGCTTAGA TGTTTGTTGA 102 0 

AAAGATCTCG AGTGTATCTT ATTTCTTTAA GTGATTTATC ATAAGCTTCA ATTTGTCCTG 1080 

AAAGGTTATG ATATTTTAGT TGTAGTTTTA CTAATTTTAA TGATTGGTCT TGCATTTGTT 114 0 

ATGTCTCCTT TAAGATGTTT GTTTGCGTTT CGTGTACTTT GTGGGTAAAA AAATATCTCC 12 0 0 

AATATTTTCG TCAAAAAAAT CAGCGATAAT AAACATCTCA TCATTCTTAA ATTGATGCTT 12 6 0 

TCCTAATTCT TTTAAACGAT AACCTTCAGT TGATATATTC AAGAGGTTTG CTAAATCTTC 132 0 

TTGAGTACAC TTTCTTTCTT TTCTCAACTT TATTAAATTC CATTGCATGT TGTCACCTCC 13 8 0 

CGCTTTACAA AACCTACTAT ACACGATACG GTACTTGnGT CAACATAAAA GTTTGCTTTT 14 4 0 

CGTGTATTT 144 9 
(2) INFORMATION FOR SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

5 ACCAAAAACA CTTTCTAAAG AAATTGTTTC AGAACCATTT GCTAAAAACC AATTACGTGA 60 

AAAAGCTCGT CTAACTAACA TTAAAGGTTT AGAGATTCCA AGAGTTTCAT ACACTTTAGA 120 

CGATGATGAT TTCATTACAG ACGTAGAAAC AGCAAAAGAA TTAAAAGCAA AAGGTGATAC 180 

'° AGTCAAGTTC ACTACTAATA AATTCAAAGT ATTTGCTGCA ATTTCAGATA CTGTAATTCA 24 0 

TGGATCAGAT GTAGATTTAG TAAACTGGGT TGAAAACGCA CTACAATCAG GATTAGCAGC 3 00 

TAAAGAGCGT AAAGATGCCT TAGCAGTAAG TCCTAAATCT GGATTAGAAC ACATGTCATT 360 

75 

TTATAATGGA TCTGTTAAAG AAGTTGArGG AGCAGACATG TATGATGCTA TTATTAACGC 420 

TTTAGCAGAT TTACATGAAG ATTATCGTGA TAACGCAACA ATTTATATGC GATATGCAGA 4 80 

2Q TTATGTCAAA ATTATTAGTG TTCTTTCAAA TGGAACAACA AATTTCTTTG ACACACCAGC 54 0 

AGAAAAAGTA TTTGGCAAAC CAGTAGTATT TACAGATGCA GCAGTTAAAC CTATTGTGGG 6 00 

AGATTTCAAT TATTTTGGAA TTAACTATGA TGGAACAACT TATGACACTG ATAAAGATGT 660 

25 TAAAAAAGGC GAATATTTGT TTGTATTAAC AGCATGGTAT GATCAGCAAC GTACATTAGA 720 

CAGTGCATT C AGAATTGCAA AAGCAAAAGA AAATACAGGt CCATTACCCA GCTAAGCCCC 780 

AAAAGGTTAA TGTAACAGCT AAGGCTAAAT CAGCTGTAAT ATCAGCCGAA TAGGGGTGAT 840 

30 GAAATGAGTT TAGAAGAAAk TAAATTGTGG TTGAGAATTG ACTATAATTT CGAAAATGAT 900 

TTAATTGAAG GTCTCATTCA ATCGGCTAAG TCTGAATTAC TATTAAGTGG GGTnCCAGAT 960 

TATGACAAAG ATGACTTGGA ATACCCGCTT TTTTGTACAG CGATTAAATA TATCATTGCA 1020 

35 

AGAGATTATG AAAGTCGTGG ATACT CAAAT GACCAATCTA GAAGCAAGGT GTTTAATGAA 1080 

AAAGGATTG C AAAAAATGAn TTTGAAATTA AAAAAGTGGT AGGTGATTTT TAAATGGAAT 1140 

4Q TTAATGAATT TAAAGATCGC GCGTATnTTT 1170 

(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1393 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

b ° (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 

TGAAGGGTGT TTTGTTTATG GCCAATTGCT GTGTTATTAA AnCGAATGTT TCGAATGGAA 60 

55 
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ATCATTCCGG AAGAGGACAT GTCTTCAGCT TCTGATTTGA AAGCACAGTT TTATCGCGGT 18 0 

ACTTTGCAGC GCTTTTACCA At CnTTGCAC GCAGAAAAGC TTACACCTTA TGTTATGTCT 24 0 

TATGACGATA TCATTTCATT TTGTAAAGAA AACAATATCT CTGAAGTAGT GACTGCGGGT 300 

GATATTATGA GTTATCATCT TGAAGAATAT GATATTTTAC ATCAACGTTC TTTATTCAAT 36 0 

GAAG CACGCA TTGCCGTTAC TTTGATACGT GGGAATCATT ACTTTAAAGC GAGTAAAACA 42 0 

ATGAATCAAC AAGGGGAG CC ATACAATGTT TTTACTAGTT TCTATAAAAA ATGGCGACCT 48 0 

TACTTGAGG C ATAGAGACGT ATATCACTAT GATTTAAAAT CATTCGAAAA CTTTGTCATT 54 0 

GCATCACCTG ATGATTTAGT GTTTGATGAC ATAGCATTTG GATCCTCACA aATAATTGAA 600 

CAGatAAATG GCAACATTTT TTAGATCAAG ATATACAGAA TTACGAAAGC GGAAGAGACT 660 

ATTTAC CTG A AGTATTAACA AGTCAGCTAA GTGTTGCTTT AGCATATGGA TTATTAGATA 720 

TTATTGAAA T TTTTAATGAT TTATTGGCGC GTTATGATGA AGATGAGGCA AACTATGAAG 78 0 

CATTTATACG TGAACTCATT TTTAGAGAAT TTTATTATGT GTTAATGACA CAGTATCCTG 840 

AAACCTCATA CCAAGCTTTC AAACCTAAAT ATCGACAGAT AAAATGGTCG CAAAATGAAG 900 

CGGATTTTAA TGCATGGTGC GAAGGGCAAA CAGGATTTCC AATCATTGAT GCAGCAATAA 960 

TGGAATTGAC ACAAACTGGT TTTATGCATA ATCGAATGAG AATGGTTGTG TCGCAATTTT 1020 

TAACCAAAGA TTTATTTATA GATTGGACAT GGGGAGAAAA ATTCTTTAGA AAGCACCTTA 1080 

TTGACTATGA TGCAGCATCA AATATTCATG GATGGCAATG GTCTGCTTCT ACAGGTACGG 1140 

ATGCAGTGCC GTATTTTAGA ATGTTTAATC CAATAAGACA GAGTGAACGC TTTGATGCTA 12 00 

AAGCTTTGTA TATCAAAACA TATCTTCCGA TTTTTAATCA AATTGATGCA AAATATTTGC 1260 

ATGATACACA ACGCAATGAG TCCAACCTTT TTGAACAGGG GATTGAATTA GGTAGTCATT 1320 

ATCCAAGACA AATGGTAGAT CAT CAAG AAA AACGTACACA AGTTTTAGCT ACATTTAAAG 13 80 

CGCTAGACTA ATT 13 93 
(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 

TTAACTTGAA AGATTTCCCC GACATCTATA ACTTGTATAA TGTGTATATG TCGTTTTTAG 60 
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AAAAATTAAA AAAAGAATAT CTTGATTGGT ATAATCAGAC CTTAGAGTTT TCTAATTTAT 180 

CAAACAATGT AGTAAGwATA GATACTCCTT TTAAAGATAA TTCTTTAGAT AATTTAATAA 24 0 

TTTACGCTTT ATACGATCAG TCCAGAGACA TGATTACACT GACAGATGAC GGCTATACTA 3 00 

TATtTGATTT AGAAAATAAT GGTATTTCTT TAAATAAATC AAAAAAACGT AAAAAGATTT 36 0 

TTGAAGAGCA CCTTTCAGCT TACGGTATTA AATATAACGA TAAAACTCAC GAAATTTTTG 420 

TTCAAACTAA CTTTAAAAAT TTTAATAAAT CGAAACATAA TTTATTACAG TGCCTTATAT 4 80 

TTGTTAATGA TATGTACTTA CTTTcTAATC CTAAGTCACA GAACATATTT ACAGAAGATG 54 0 

TTGCAAACAA ATTGGATGAA CATAACATTT ATTACGGAAG AGATTTACCT ATTATAGGAA 600 

GCAGTGGTGT TGTTCATAAT TTCGACTTTT TTATTAGCGC TAAGAAAAAT CAAAAAGAAA 660 

AATTTATCAA TGCTATTTCT AACCCTAATA ATTCTATGAT TATTAAGTCG AAAATAACGG 72 0 

ATGCTATGCA AGCAAAAAAA ATAAAAAGAC ACAGGCAAAA TGAGTTTATT TTTATTTTAA 780 

ATGACTCAAA AAAAGAAATA AATGAACATA ATAAAAATCT TCTTCATGAA AACTATATTA 84 0 

GTACAATAGA TTATAGCGAA TTAGATGAAA AGATAGGTTT ATTGATTTAA TATATATAGA 900 

CGTGATAATA TCAATGTTTA TATTAAATGA AACGAATTTG AAAATTTCGA AACAGCTTAA 960 

GAT AG CAAAT TGAATAGCCT TATTGATAAT GCAGAATCGT CTACACTTAG TTGAACAAAT X020 

TCTATGAGAA TAGATATTGT TAAACTATTT GGGTAGGCGA TTTTATTTTG ACAAGAGTCA 108 0 

GAAGATTATT TAGTTCATAG TTTAAGTTAC AAATGGCTAA ACTTTATGAA AATGGTATGA 1140 

CTAGAATCGA AATTATACTC GAATACGATT TAACACTCTC AATCTTCTCG AACTGAATAA 1200 

ATACTGGGTC ATTCAATCAT CAAGACAACT TAATAAGTGA TGATAAATGT TCATACCATT 1260 

TTGCAAATAT AACCAGATAC TCTCAACAAA ATGTAGAGTT CAAAATTAAA AATGTCCCTA 13 2 0 

ATTGGCACAT TGCTGTATGA TAATAAATTC AACTTAGATT TCAACAATAT AACTAATGAT 13 8 0 

GCTTATATTT ACCGCGATAT CAAACCAAAC AGCCCATCCA CAAACAAATT AG AAT AT AT C 144 0 

mTGTTAATAA TAGTAAATGA TATATCTTCC AAAGACCGCA AACT i 4 84 
(2) INFORMATION FOR SEQ ID NO: 5 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 
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CTTTATCaTT AAATGCCaTT TTGaATCCTT CTTGAATTTG TTTGATTTCC TTTTCAGTAT 120 

TATTAACCTT GTTCAAGACA GTGCCATTAA TTTTCTTCTT GAGTGGACCA ACCTCATCTT 180 

5 TATTAGTTGA TTCATGAATC GTTAATCCAA TGTGTGGCTG TATTCTATAA CCATCATTCG 24 0 

CTATAGTTGA AACATATTGT GaTAATTGTA ATGGTGTATA GGTATCATAT TGACCAATTG 3 00 

ATAAATCTAG ATAATTACCT GGATTATTTG TTAATGGTTC GATTTGACCT CTTGTTTCAT 360 

to 

TTGGTAAATC TATCCCTGTT TTCACACCTA AGCCTACTTG ATTTAATCCT CTTCTTAGCT 420 

TTTGGGCAGG TGAACTTATG TCTGAAGGTA AAGCCATACC AGAATAATAA GGGTCTCCCG 4 80 

j5 CTAATTTTAA TGCTGTTTTA AACATATATA CGTTTGATGA ATGCATCAAA GCTTGCTTAT 54 0 

CATTAATAGT TACATGCCCG TTTTTATTGA AGTATGATCG TTTTGTCAAA CCACCTTGGA 600 

AATGTAATGG TTCATCGACC ATTGTTTCTC CAACTTTGAT AGCTTTATTC TGATAACCGG 660 

20 CTAATAATGT TCCACCTTTT ACAGAAGATC CAACCGCAAA TTGAGAAGTA AACGTACCAA 720 

TGT CATAATC AGTCATTTTA CCACTCTTAT TAATCTGCTT TCCGGCAAGC GCAAGAATGT 780 

CTCCATTTTT AGGATTTTGT ACAACCATCA TTGcATTATC CATATCTTTG GCACCTTGAC 840 

25 TGCGAsTTCT TAATTTGTTT ATCTAATAAT GCTTCTACTT CTTTTTGAAG ATCTATATCG 900 

ATCG TTAATT TCAAATCTTG ACCGCGAGCG CCAGGrTTTA ACACTTCTGA AGATGTAACT 960 

TTACCAGATT TGTCCGTTGT GTATTTCATT TCTTTCTTCT TACCACGcAA TACATCTTCA 1020 

30 

TATTGATATT CTAGGTAAGA TTTTCCAACA CGATCATTGC GTGAATATCC TTTGGATAAG 108 0 

TAATGTTCTG TCAATTCTTT TGGaATACCT TCAGCAGGTG TCGATACATC TCCGAATATA 114 0 

CCTCTTAAAG TATCGCCATA TGGATATTTT CTATCCCAAT CCATAGACGT GTTAACACCT 12 00 

35 

GGTAATTTGG AAAGTTGCTG AGAAACTGCT GCATACTCTT TTTCACTGAC ATCTTCATTT 1260 

TTTATCATTT GTGGATCTAA AACTGTTCCT GGCATT CATC TCTCGAAAAA TAGCTAAAAC 13 2 0 

4Q TTGGTAAATC TTTAGAAGAC AATTCATCTA ATTGTGGATT TTCCGGATTT CGGATAACAG 13 8 0 

TTGGTnTATC CATAATGGAT CCTGGTTTAA TACCTTCCAn CnGGCGAACA TAGCC 143 5 
(2) INFORMATION FOR SEQ ID NO: 597: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 

55 
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AGAAAAAGAT TTAGAAAAGA TTACCATTCG CGACATAACA ACACGCGCTG ATATCAATAG 120 

AGGGACATTT TACTTACATT ACGAGGATAA ATATATGTTA CTCGCAGATA TGGAAGATGA 180 

GTATATTTCA GAACTAACGA CATATACTCA ATTTGATTTG TTACGCGGTT CTTCArTTGA 24 0 

AGACATTGCG AATACTTTTG TAAATAATAT ACTCAAAAAT ATTTTCCAAC ATATTCATGA 3 00 

TAATTTAGAG TTTTATCATA CTATTTTACA ATTGGAACGC ACGAGTCAAT TAGAATTGAA 360 

AATCAACGAA CATATTAAAA ATAATATG CA ACGTTACATT AGTATCAATC ATTCTATCGG 420 

AGGCGTTCCA GAGaTGTATT TTTACAGTTA CGTTTCCGGA GCAACAATTT CaATTAnnTA 4 80 

AATACTGGGG TAATGGACAA ACAGCC CATT TTCAGT 516 
(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1955 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 

AGTACTTTAC CATGAGGTAA GACTTTATTC CCTGCTTTTA TAGATTGTGC TTTTATTTGT 60 

GTGCCAGAAC CAATAGtAct GGT cTAATGA TTTTGCACTC ACATTATCAA TAGCTTTTTG 12 0 

TTTGTCTTGA CCTTCAACAT CTGGCACTTT ACTATACTCT GCATTAGATG TGTCATCTTT 180 

TGATTTACCT ACATTTAAAT ATTTCAAAGT ATTTTCCATT ATTGGTTTAA ACGCTTTACT 24 0 

AACACCTAAT TCATAAGCTT CTTGGTCATT TTTTTGTGCC AAGCTCATAC CAG CGTATAC 3 00 

AATAACTTTA GGATTTTTCT TCGGCGCGTC ACCCATAAAA CTTACAAAAT ATGGGTTTGG 360 

ACCTTTAACG TATCCACCAC CATTAGGTGC AG CGACTTGT GCTGTACCAG TCTTACCTTC 420 

GACCTCATAA CCATCAATAC GATAGTTTGC AGCGTGACTC TTCTTACTAT TCACAACTAA 4 80 

ATCCAATTGC TTTTCAACTT TTTCAGCAGT ATCTTTTGTT ATTGGTTTGC CTGCGATTTG 54 0 

TTTTTGCCCT TTATAAAATT GTCTTTTACT AACAGGATTT TCAACGCTAT TCACAAACCA 6 00 

TGGTTTTAAC ATATTACCAT CATTAAAGAA CGCTGATTGC GCTTGTAACA TTTGAACAGG 660 

TGTTACTGTT GTCGATTGAC CAAATGATGA CGTTTTTTGt TG CAACTCAT TACTCCATCC 720 

AATTTGaCCA GGTGCTTCTC CATCAAACAT ACCTTTAGTT GaTTTTCCaA ATCCaAATCG 780 

TTCATACCAA GATTTCATTT TGTCTGCACC AACTAAATCT TGTAAATGCA TCATCAATGT 840 

ATTAGATGAA TAAGTAAATC CGAGTGACAT TGGGATTTCA CCCCAACCGA CTCTATTCCA 900 
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AAAAGCACCT TCTTGAATAG CAG CTGCTAA CCCATATGAT TTAAATGTTG ATCCAGGCTC 1020 

GTATGTGTTT TGATAAAGGT CATTTGCCCA CTTTTTACCA AAGTCTTTAC CAGTTTCAGG 1080 

ATTAAATGTT GGTCGCTGAC TGTATGcTAA AATTTCTCCA GTTTTGGCAT CCATGACAAC 114 0 

CGCAAATAAA TCTTTCGGCT GGTATCTTTC AACCATG C CA TCTAAAGCTT CTTCAACAAA 1200 

TACTTGAATA TTTGAATCGA TTGTTAAATG GACATCATCA C CACGTTT AG GCTGCTTCTC 1260 

TTTTTTAGTA TTTGGTGCGA TATATCCCCA AATATCATGA ATATATCTCA ATGATCCTTT 132 0 

AGATCCACTT AAATAACTAT CAAAAATCTT TTCAACTCCA AGTGCACCTT TAAGTTCACC 1380 

AGTATCCGGA TTTTTCTGAG CTCTACCAAT TAAGTGTGAT GCAAAATTGC CATTTGGATA 1440 

AAAGCGTTCT GTTTCAGGCA ATAAAGAAAT ACCAGGCAAA TTCATTTTCT CTATTTTCAA 1500 

TTTGTCCTGA TACGTTAAAT TTGTTCCTTT GCGTCCAAAT TCAATTTGGA AAG CTTTCTT 1560 

TTG ACT AAG T CTCTTTTCAA TTTCCTCTGG CTT CATATTA ATGACTGTAG ATAATTTCTT 1620 

TGCAGTCTCT TTTTTATCAA CTACATGCCT AGGTTTTTTA GAATTGGCAC TCGCCTTTTT 1680 

ATCTATTACT GCAACAAGTT TATATCTTTC TACATCTTCT GCTAGCACTT TACCATkACG 174 0 

ATCATATATC TTTCCTCGTT CTGGTTGTTG TGCATTCTTA ACTAAATACT TTTCATTTGC 1800 

CTTCATGACT AAATCTTGAC CATTAGAATG TCCAGTAATC ATGATATATG AAATT CTTAA 186 0 

AACCAATATA AAAAAGAGCA GTCCGAATAA A C CAACAAGT AGGACTGCCC CTATTTTATT 192 0 

TTTTTTAATT TTAATTTTTT TGGTCGCCAT TACTA 1955 
(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 

TAG CAAGAGT AGGTGATAAC AGTTCTTTTA ACGTGTTTGC TGCTTGCTCA TTTCTTGGCT 60 

CATTGTAGAC TAAAATATAA TAAGCGTTTT CAAATAAGTG CTTTTTAGCA TTTAGTACAC 120 

CAGATTTATG ACTACCAGCC ATTGGATGAC CACTGACTAA ATGAATATTA TGCTTTAATA 180 

AATTGCATTC GTGTTGCTGT ATCATTGCTT TAGTACTACC AGTATCAGAA ACAATAACAC 240 

CAGGTTTAGT TGGCATATCT ATAAGCTCGC TAAGATATTT ATTTGTGATA GCAACAGGTG 300 

TTGCATAAAT AATTACATCG GCTTTTTTAA TAGCTTCACT ATAATTTAAA CATTTTTCAT 360 
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TATTAG TATT AGGGTTATGG TATTTTATAT TGCTAGCAAG ACTTCCACCA ATTAATC CAA 4 80 

GCCCAACAAA TAAAACTGTT GTCATATAAA TCACCTTATT TCGAAATTTT CAGAATAATA 540 

ACATTGTAAA TGAGCTGTTG ACACAGTGCA ATAGTAAATA AAAATCGATA ATAGCATTAA 6 00 

TAGATAAACG GAGATAAATC ATCTACAATA AAGAGTATAG TAACACAATG G CAACGG AGG G60 

GGTAAATCAA TGGAACCAAT ATTAGAAATG ATtAAAACAT TAAC 7 04 



w 

(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1158 base pairs 
)5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 

CCTCTAAAGG ATCACGAACC TTTTTCATCG TGACTAAGCC TATAATACCC TTAAAc CTAT 60 

TATTATTAAC TTTTACCTCT GTGTATTCCC TATCAATTAG GCGACGCCAG TGACGTTTAT 12 0 

25 CTATATACTT TACTTTCACA GTCACCAACT CCTTGTCATT ATTATATAAA ACTACATAAA 180 

TGATGTCATG TCATGATACA GTTTTATAGT TTTTGTTAAT CATATGAAAA TATTTATTAT 24 0 

TATTTTACTA CAACTCGCTT CAATTTACTT AAAATAGACA ATATTAATTA GATAGTACAC 300 

30 

ACATTTCTTC ATAAAAGTGA TTTTTCAAAA ATATAAATAA CACACTCTTA TCGTTTTCAA 3 60 

AATCATTTAA TGCTATTTTC ATTAAAAACA G CTGAAG CAT CAAATCTATT CTGATTCAAT 420 

CAAGAATACA TATAAAAATG AAGTGACTCA AAGGTTTATT AACAACCTTC AAACCACTTC 4 80 

35 

ATTGATCACT TTTATTTTAA AGCATATTTT TCGATTACTG ATTTAAGATG CGGATATTGC 54 0 

GTAATTAATT CAGATTGCTT AAACAATTCA AACTGCTTAA ACTCAAATGC CGGTTGACAC 600 

40 ATACAACCTA CTAAACTAAA AGTATTTGAT ATTTCGATTG AAGAAGCAAA AATTGTTCCT 660 

TTAGGCACTA CATATTGCAA TACATCTCCA TTTTGGATAT CAGTACCCAA TGTTGCAGTC 720 

GTATATTCCC CATCCGGATT TATCATATGA ATTGTTAGAG AATCGCCAGC ATGATAGTAC 7 80 

45 CATACTTCAT CAGCATCAAT TCGATGAAAA TGCGAAATAT TGTCATCTGT AAGTAAAAAA 840 

TAAATACTAC TAAACGGCGC TCTGCGTCCA TCTTTCAATA CTTCTCGAAT TGTCTCTCTA 900 

TAGAAACCAC CTTCAGGATG TGATTCAAGT TGCAATTcAT CaATCCATTG TTCTGCTGAT 96 0 

50 TTCATTATTT CAAATCCACA TTATGGAATA CGTTTTGTAC ATCTTCTAAA TCTTCTAATG 102 0 

CATC G ATT AA TTTTTCAAAT GTTACTTGGT CCGCTTCAGA AAGTTCAATA TCTGTTTGAG 10 8 0 
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GAACGACTGC AAATTGAT 11 5 9 

(2) INFORMATION FOR SEQ ID NO: 601: 

s (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 



TGATACTTAT 


TTTTCAATAT 


TATTTGGAAT 


AAAnTCTTnT 


AATTGTGGTG 


TGAAATAAAC 


60 


ACCGAAATTC 


TTTTCAGATT 


CATCTTTTcC 


ACTACCTGCA 


TATAAAATAC CAATCAGTyC 


120 


ATGTTTAGAA 


TTCAATACAG 


GAGATCCTGA 


ATTACCTGGT 


TGTGCATACG 


CAT CAAATTC 




CATAAACGTT 


CCACTGATAT 


GGTTAATCGT 


TCCTGTCGAT 


TCAAACATTT 


TATATTTTGT 


240 


TTGTGCACCC 


TTTGGATAAC 


CAATAACAGA 


AATTCTATCT 


TTCACTTTTG 


CTCCGTCTGC 


300 


AAATTTTGTA 


TAACTAACGT 


TCTTATTAAA 


ATTCAAACCT 


TCTGTACTTG 


TTTCATGAAC 


360 


ATGAACTATC 


GCAAGGTCTT 


CTTTTCCGGG 


ATATTCTACA 


ATGTCTTTAA 


CGTCGTAGTT 


420 


TCCTCCGCCT 


TTACCTTTAC 


TCGAATGATG 


TGCTGATACT 


CTATTTTTAA 


AAATATCATT 


480 


ACTTTTAGCG 


ATATGTTTGT 


TAGTTACGAT 


TGTATTTTTA 


CCAACAACTA 


CACCAGTACC 


54 0 


ACCCACAAAT 


GCTACCACTG 


aATTGTATGG 


TTCCTTAGTT 


GCATCGGTAA 


TTTCTTTGAC 


600 


ATTCTTTTCT 


GCTTTGGCAA 


TTGAATGArG 


CTGATTAGAA 


ATATTTTCAG 


CAAAACCAAG 


660 


AGATGTTAAA 


ATAGTTAAAG 


CAG TTAAACC 


TTTAACCATT 


ACATTTTTAT 


TCATATTTTA 


720 


CCTCCTTCCA 


AATTTATTTT 


nATAGTTAAT 


GTATCTAGAT 


ACA CATTTAC 


TATATACATT 


780 


TTAAATAAAA 


ATATTnATTA 


AATATTAATT 


AACCACTTAT 


nAACGTT 




827 



(2) INFORMATION FOR SEQ ID NO: 602: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 602: 
50 nGGnACGTAA AGTATTTATG TTAAATGGCA nAAATATATG AATGATGAGT TACCTGGATT 60 

ACCAATGTTC CAAGGTAAAT CGATAACTAT TGTTAACGAT AAAGTACGAA ACTTAGACAT 120 

55 
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CGACGATTTT GTCGAGGTTT ACTTATATGG GGGGATGTTA TGAATAATGT ATTG TT AG AG 24 0 

GTTAAAGATT TAGAAACATC ATTAAAAATA AATAATGAAT GGTTAGCAAC TGTTGAAAAT 30 0 

5 

ATTTCTTTTG AATTATCTAA AGGAGAAGTT TTGGGTATAG TAGGGGAATC TGGTTGCGGT 36 0 

AAGTCCATAT TAAGTAAGTC AATTATTAAA TTATTACCAG AAAAGATATC TAAACTAAGT 42 0 

w AATGGAGAAG TTATATTTGA TGGTAAACGA ATCGATACGC TCAATGAGAA GCAATTGCTA 480 

GATATTCGAG GAAATGATAT TGCTATGATT TTTCAAGAAC CTATGACTGC TTTAAATCCT 54 0 

GTATTTACCA TAAAAAATCA ACTTGTGGAA TCTATAAAAT CACATAAAAA AATTTCTAAA 600 

15 AAAGAAGCAA ATAAATTAGC AAAAGATTTA CTAAAAAAAG TTGGAATTGC TAGACAAGAT 66 0 

GAAATATTAA ATAGCTATCC TCATCAATTA TCTGGTGGTA TGAGACAAAG AGTAATGATT 720 

GCAATGGCCA TTTCATGTTC TCcTAAATTA TTAATTGCTG aTGAACCTAC AACAG CATTG 780 

20 

GATGTCACGA TTCAAG 7 96 

(2) INFORMATION FOR SEQ ID NO: 6 03: 

2S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1618 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 
GGATGTGTTC ATCCCAAATT GaAAcAgAaA TTTATGATCA AAtGATGTGG ATTGGTTTTA 6 0 

35 

AaCCATTCAa AATTaCCAAT ATTAma CAAG AATCTGAAGA CATTAAATCA TTTtACAGTT 120 
GAAACTGAAG AATATGACTT TAGTGAATTT ACACCAGGCC AATACATCAC AGTTGATGTT 180 

40 TCTAGTGATA AACTTCCATA TAGAG CTAAA CGTCACTATT CTATCGTATC AGGTGAAAAA 24 0 

AACCATTTAA CTTTT GGCGT TAAACGTGAT GTCACAACAG AACATGAAGG CGAAGTTTCA 3 00 

ACAATTTTAC ACGATGAGAT TAAAGAAGGC GATATGATTA ATTTAsTGCG CCTGTAGGTG 360 

45 GATTCGTATT AGAGAATACG ACTGAACCAC AACTTTTCTT AGGTTCAGGT ATTGGTGTTA 42 0 

CACCTTTAGT AGCTATGTAT GAAGCTGCCT CTGCCAAAGG TTTAGATACA CAGATGGTTC 4 80 

AAGTTGCTGA AAATGAACAA CATTTACCTT TCAAAGACAA CTT CAACAGT ATCGCAAGCC 54 0 

50 

ATCATGACAA CG CTAAATT A TATACACACT TAnAgATAAA CAAGGCTATA TTGGTGCTGA 600 
AGAATTACAA GTATTTTTAG CAAATAAACC TGAAATTTAT ATCTGTGGTG GTACAAAATT 660 
55 CTTACAaTCT ATGATCGAAG CACTTAAATC TTTAAATTAC GATATGGATC GCGTACACTA 720 
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ATATGAAACT TAATCAACGT TACGTAAAAG TATTTGCATT ATATTTCGTA AGTATTGTTA 840 

CTGCAAATAT TATTGTTAAA AATAATAATT TAATTAAAAC AtTGATACAA ACCATAGCCG 900 

GGTACACGGT CTTTGCAGTT GGTTTGAAGT ATTTAACTAA ACGTAAAAAT AAATGACATC 960 

TATACCAAAA CAGCTAT GAC TTTACTTTGT CATGGCTGTT TTTTAATTGG GAGTAGGACA 1020 

GAGATGATAT TTTCGCAAAA TTTATTTGGT CGTCCCACCA CAACATGCAT TGATGTATGC 1080 

TCACTGAATT TCATAAGAAA GGAGTTCACA AGATGACCGT AGATATTGGA CGGATTTATG 1140 

ACAATAAAGA TAATACCGAC GCTATTCGTA TCCTAGTCGA TAGAGTCTGG CCGAGAGGTA 1200 

TTTCGAAAAG AACTG CTAAC CTAGATTATT GGTTAAAAGA CATTGCCCCT TCTACTGAGT 126 0 

TGCGACAATG GTTCCAACAT GATCCTAAAC TTTTTGGAGC TTTTAAAGAA AAATATGAAA 1320 

AAGAATTACG TGATCAGGAT GCGCAAAAAG ATGCTTTTGA AAAATTAAAG GATATTGTAA 13 80 

ATCAGCATAA TCATGTTCTA TTGTTATATG CAGCAAAAGA TACTAAACAT AACCAAGCTG 144 0 

TAGTACTACA GCAGTTGCTC AATACTTAGT TATATATGTT TATCGTGAAT CACTATAAAT 1500 

GTCGCTAACT TCATTTCTAA TATTTATTTT TAAAAAGCAT AATTACTACA ATTAATTGGA 1560 

ACTTTAATAA TTACTAAACT TGGAA CAATA TTTTACTTTA ACAAAATAAA GTTTTAAA 1618 
(2) INFORMATION FOR SEQ ID NO: 604: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2115 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 

ATTATAATGC AAATATTGAA GCATTTAAAA CAGTCGCTAA AGCAGTAGGC AAAGAGAAAG 60 

AAGGCGAGAA GCGTCTGGAA AAG CATGAT A AAATATTAGC GGAGATTAGA AAGAAAATTG 120 

AACAGAGTAC GTTAAAATCT GCATTTG CAT TCGGTATCTC AAGAGCAGGT ATGTTTATTA 180 

ATAATGAAGA TACATTTATG GGACAATTCT TAATTAAAAT GGGTATTCAA CCTGAaGtcA 24 0 

mAAAArAmAA AACTACGCAT GTTGGTGAAC GCAAGGGTGG TCCTTATATA TATTTAAATA 3 00 

ATGAAGAACT TGCCAATATC AATCCAAAAG TTATGATTTT AGCCACTGAC GGAAAAACGG 360 

ACAAAAATAG AACGAAATTC ATTGATCCTG CAGTTTGGAA ATCATTAAAA GCTGTGAAAG 42 0 

ATAACAAAGT TTATGACGTT GACCGAAATA AGTGGTTGAA ATCAAGGGGG ATTATCGCAA 4 80 

GTGAAAGTAT GGCAGAAGAT TTAGAAAAAA TTGCAGAAAA AGCAAAATAA AAATACAGCG 540 
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TGTGATGATT TACCTGAAAG TTTATTTCGA ATAAATTTAA TTACATAACC GACAAGGATT 660 

GTTTTAACAG TTCTTTTAAT GAATTGGCGC ATCGTTACAT ACCTCATTTC TCTATATCTT 720 

ACGAACTATA TACCCATTCA TATATGCTTT TTAAACGTCA TTGTCACAAT TTAATTTTTA 780 

GGGAATATAA TATAACCATC TTTATCTGCT TTTTTAGTAA AAATGACAAA AATTGCATGT 84 0 

ATTATTGaGA TGATGGTAGG GATACCTGTC CAGAAAAATA ATAAGTGAAA AAGACCTTGT 900 

CCAAATTTAT CAGCATAAAA TTTATGaATA CCTAAACCTC CAAGAAATAA TGCAACmATa 960 

ACATAAATGG CTTTATTGAC TTTCATTTGT AAtCCTCCTT AACTATAATT CTACTTAAAT 1020 

TCGTTGTGAA AACCAATATT TCTAACTTTA GAATTTTCAA ACTTTCTAAA ATTATAAGTA 10 80 

TATCTTTTTA AAATAAGCTA GAATTTCTAT ATAATAAATG TTAATAACGT AAAAGGGAAT 114 0 

GATGACATAG TGATACGTCA AGCACGTCCA GAGGACCGAT TTGATATTGC GAAGTTAGTT 1200 

TATATGGTTT GGGATGATAT GGAATTAGAA TTGGTAAAGC ATCTACCTAA AGACATGGTA 1260 

TTAGATGCAA TTGAAAAAAG CTGTGTTGAT GCAACATATC GAACTTTTTA TCAGCATATT 13 20 

TTAGTTTATG AAGTAGAAAA TAAAGTAGCA GGTTGTATTA TTAGCTATAG TGGTGAAAAT 13 80 

GAATTGAAAT ACGAAAAAGC ATGGGAACTA CTTGACTTGC CAGAAAAAAT AAAACAATAT 144 0 

GGCACGCCAT TACCTGTAAA AG AAG CT AAA GACGATGAGT ATTATATAGA AACAATTGCG 1500 

ACATTTG CAG CATATAGAGG TAGAGGCATC GCGACAAAGT TATTAACGTC ATTACTTGAA 156 0 

TCAAATACAC ATGTTAAATG GAGTTTGAAT TGCGATATTA ATAATGAAGC AGCATTAAAG 162 0 

TTATATAAAA AAGTAGGCTT TATATCTGAT GGACAGATTG AATTATACAA GCACATGTAT 16 80 

CATCATTTAA TTGTTAAATA AAATACTCGA CAGTTCGATG TAAGTCGATT GCCGAGTAGT 174 0 

ATCATTTTCT ATTAAATGCC TGCAAATAAT GCACTAATAT AAATACCTAA TGCATATAAT 1800 

AAACCGAAAA ATGTATTTGT TTTACCAGCA GCAGCCATTG CTGGCATCAT TGTAGGCGGT 18 60 

GTATCATTCT TCTTGAAACG TCTGATAACT TTAACAGGCA TTGGGAATGA TAACAACGCA 1920 

AGTAAGTAAA ATAATGAGCC AC CAGGTTT A ATAATGATCG TAAGTACAAT AAAGGCATAA 1980 

GCGATAAAGT ACATGATTGC CATAAATGTT AAAG AAG CAT TTTTACCTAA TAGAATGGGT 204 0 

AAAGTTTTGC GACCACTTGC TTTATCTTTG ACACGGTCGC GAATATTGTT AGC CATATT A 2100 

ATTAAACCGA TAGTG 2115 
(2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

TATGTCTCAA ACTCAAATTA ATCAAATGTT CCAGCAGAAA AACATGTCTA CTGAATTAAA 60 

ACGTCGTTAT GCACAACGTT TATTACAGTT TCCACATGTA CACAATAAAG AATACTTGAA 120 

ATCTTATGCT AAAAACCCTA AAGAAACTAA AGATAGTTAT ATTTCTGGTT TTAAAGAGAA 180 

TCAATTGATT AAAATAGAAG CGATTAAATC ATTGTTTGCA ATGGATAAAT CTCCATTAGA 240 

ACATGTTAAA CCTGCTACAA AACCAGACGC TTCTTGGGAT GAGATGAAAC AAAAAGCAGT 3 00 

TGAAATTGGT AAAGCTGATA CTACATCGAA TAAATTTGGT ATTAGAGATC AATACTGGAA 3 60 

ATTAATTCAA GAAAGTAAGC GTAAAGTTAG ACGTGACTAC GAATTCAATG TTAATTCTCC 4 20 

AGAATTCCAA GATTTAGAAT TACTTGTAAA AACAATGCGT GCTGCTGGTG CAGATGTTCA 4 80 

ATATGTAAGT ATT CCATCAA ACGGTGTATG GTATGACCAC ATTGGTATCG ATAAAGAACG 54 0 

TCGTCAAGCA GTTTATAAAA AAATCCATTC TACTGTTGTA GATAATGGTG GTAAAATTTA 600 

CGATATGACT GATAAAGATT ATGAAAAATA TGTTATCAGT GATGCCGTAC ACATCGGTTG 660 

GAAaGGTtGG GTTTATATGG ATGAGCmAAT TGCGAAACAT ATGAAAGGTG AACCACAACC 72 0 

TGAAGTAGAT AAACCTAAAA nTTAAAATAC AAATAGCACA TAACTCAACG ATTTTGATTG 7 80 

AGCGTATGTG CTATTTTTAT ATTTTAAATT TCATAGAATA GAATAGTAAT ATGTGCTTGG 84 0 

ATATGTGGCA ATAATAAAAT AATTAAT CAG ATAAaTAGTA TAAAATAACT TTCCCATCAG 90 0 

TCCAATTTGA CAGCGAAAAA AGACAGGTAA TAACTGATTA TAAATAATTC AGTATTCCTG 96 0 

TCTTTGTTGT TATTCATAAT ATGTTCTGTT AACTTAATAT CTT 1003 
(2) INFORMATION FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 606: 

GCTnTTATAT TTAAAAAATT TATTnGCGGA TAATTACCTT TATTAAATCC CACCCATTAG 6 0 

GAwTaCGAGT AATAGGACTA ATACTGGAAT CACATAATGT AACATAACGT CCCTCCTTTA 12 0 

ACTTAATTTT AATTGTAATC AAATTTGACA ATAAGTCAAA mCATTAATAC CTATGATArG 180 

TATCATTTAT TAA CATATGT ATCATATTTT TAATCTTGCG TAATTTTTAT CGTTAACTAT 24 0 
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CTTTTGTCAT TTTAATTATG TTAAGATAAT AGTAGTTATA GAAGTTCAAT CTATAGGAGG 360 

CATAGCATGG ATATTCCAAA AATCACGACA TTTTTAATGT TTAATAACCA AGCTGAAGAA 4 20 

GCTGTTAAAC TATACACAAG CTTATTTGAA GATAGTGAGA TTATAACAAT GGCTAAGTAT 4 80 

GGTGAAAATG GACCTGGTGA TCCCGGGACT GTACAACACT CAATATTTAC ATTAAATGGA 54 0 

CAAGTATTCA TGGCGATTGA TGCTAATAGT GGCACAGAAT TACCAATGAA TCCTGCGATT 600 

TCATTATTTG TTACAGTAAA AGATACTATT GAAATGGAAC GACTATTTAA TGGATTAAAA 6 60 

GATGAAGGTG CCATTTTAAT GCCAAAAACG AATATGCCAC CATACAGAGA GTTTGCTTGG 720 

GTTCAAGATA AGTTTGGAGT AAGTTTTCAA TTAGCATTAC CTGaGTAAAA GGATTCGCAC 7 80 

AGCtTGgAAT GATAAAGAAA CACTTTTTCT TATGCATGCG TTTTACCTAT GTATTTCTAT 84 0 

TTTAAGTACA CATTAGCATT TTAGTTTCGC TCATTTTTAA AATCACAAAG ATAATTTGTA 900 

ACTTAAATGA TGATACGTTA TTTAAAAACA CGATACTTCG TTTCAATGAA CGCATTAAAT 960 

AATAAATAAA CACCTCACCA TAAGAAAGGA CTACTTTCTT TTGTGAGGTG TTTatTTGTC 1020 

GACAACTTGA TTATGATTGT TTCATTTTTT GAATAAGTTC ATAATCAGGT GTAGCATACA 1080 

AAGTTTTTTG ATTGTCA 1097 
(2) INFORMATION FOR SEQ ID NO: 6 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 

GAGGTCTAGC ATGTCTCGTT CAAAAAAATA CTTTTACTTA TCTAGCTTAA TGATTATTTT 60 

AAGCTTTTTC TTTAATACAA ATAACGTTTT CCTAAGTGGA CTTTTTAATT CTTTTATTAA 120 

ATTAATACTT TTCTGCAGTG TTATTAACTC AATTGTACTA ATTTTGTCTA TAATTTTTGC 180 

AGATCGTTCA ATTAAATCAC TAAAGCCTGA TGCAGATTGG ATTAGAATTG CGAGTAAAAG 240 

TTTGCCTTGG aTTATTCTAA TTGTTATTTT AGTACATATC TTTTCAATTG TTCGTACATT 300 

CGGTTTTATT TAAAAAAGTT AATATGTCAT TGTAGCCTTA ATACAAAACA ATACAATGTA 360 

TCATGCTATA ATGAGTAAAA CAATTTGATA ACGTTGTTGC GTATAAAAAA TATTAGATTT 420 

TCGAAATCAT AACTATGCAT CTAATCGCTA TAGTTATACA ACAAGATATA ACATATAATG 4 80 

AGGTTTGATA ATGCATCGAC AATTTTTGTC GTCGCGTTGC CAAAACCTCT TTTTTAAATT 54 0 
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TTTAGGCGCT TTATTACGTT ACCTCATTTC TTTTCTGAAT ACTGACGGAG GTTTTCCAAT 66 0 

CGGAACACTG ATAGCCAATT TGACTGGTGC CTTTGTAATG GGATTGCTAA CAGCCTTAAC 72 0 

AATTGCATTT TTTTCAAACC ATCCGACCCT AAAAAAAGCT ATTACGACTG G T T TTCTT GG 780 

TGCTTTAACG ACTTTTTCAA CATTTCAATT AGAATTAATA CATATGTTTG ATCATCAACA 84 0 

ATTTATAACT TTACTACTAT ATGCTGTAAC AAGTTATGTC TTTGGTATTT TGTTATGTTA 900 

CGTCGGTATA AAACTAGGTG GTGGTTTATC ATGATATCAA TCATTTTAGT CATGATTGGC 96 0 

GGCGGTTTTG GCGCAATTGC TAGAAGTGCC ATTACTGATT ATTTTAATCA TAAATTTACT 102 0 

TCAAAGTTAC CTATCGCAAC ATTGATAGTA AATCTAGTTG GTAGTTTTTT AATTGGATTA 108 0 

ACTATAGGCT TATCAATTTC AATCTCATGG TTCCCTGCGT TCTTTGTTAC CGGTTTTTTA 1140 

GGTGGCTTAA CAACTTTCTC AACGTTAGCC AAAGAACTTA CACTAATGAT GACGCCAAAA 1200 

TTTAATATTA ACCTTTTTCT CAATTATTCA CTTTTACAAT TCAT CATTGG ATTTATAGCT 126 0 

TGTTATATTG GCTATCATAT TTAAAAATAA AATGCTTCAT TCAG CAAATA GGTAAATTAC 1320 

GACACCTTCC TGAACGAAGC ATTTTTTAAT TTTCATGCAA ATTTTTAAGC ACCATATAAT 1380 

GCCTACCAAA TTTCAATAAT CTTTGTTGCC GTTTAAATAA TGTGAATGTC AATAAATTCT 144 0 

CCAAACTAGT CGAAAATAAA GGGAGTGGGA CATAAATCCC TAAAAAAACA GCAGTAAGAT 1500 

AATTTTCAAT TAGAAAATAT CTTACTGCTG TTCTCTATTT ATACAATACT TCGTATTGAa 1560 

TGGcTTCGCT TTCCTAGGGT GCCGTCTCAG CCTCGGcTTC GACTGGCACT GCTCCCTCAG 162 0 

GAGTCTCGCC ATTAATAcTA CGTATTAACA TGTAATTTTA CTTTTACATA CTTTAAAAAA 16 8 0 

TAAGACACTT TGCCCAACTT ACACTACCAA TAGAAACCTC TGTTAGAATT CCTCAAAATG 174 0 

ATATTTCGCG ATATGTTAAT GAAATTGTTG AAACGATACC TGATAGCGAA TTCGATGAAT 1800 

T CAGA CAT CA TCGTGGCGCA ACATCCTATC ATCCAAAAAT GATGTTAAAA ATCATCTTAT 186 0 

ATGCATATAC TCAATCTGTA TTTTCTGGTC GAAGAATAGA GAAATTACTT CATGACAGTA 192 0 

TTCGAATGAT GTGGTTAGCT CAAAATCAAA CACCTTCTTA TAAAACTATT AATCGTTTTA 198 0 

GAGTGAATCC TAATACTGAT GCGTTAATTG AATCTTTATT TATTCAGTTC C 2 031 
(2) INFORMATION FOR SEQ ID NO: 608: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TCCCCGGGAA TCGAACCCGC GAnCTCCTGC GTGACATGCA GGCGTGTTAA CCGCTACACT 60 

ACGAGACCTA TTAnATTAAA AACTATGTAT TGCGGGAGGC GGATTTGAAC CACCGACCTT 12 0 

5 

CGGGTTATGA GCCCGACGAG CTACCGAACT GCTCCATCCC GCGCTAATAT TATTTTGAAT 180 

TACCTAATTA ATATACCATA ATCAAAAACC TAAAGTsrrG AACTTTTTGA ATTTAATTTA 24 0 

tg AATGTTATCT CTTAAATAAT TACTTAAATA TCGTAGCAAC ATGTTCTCTG TTGAACACAA 30 0 

ATATTAGTAT ATTCATTTTT GTAGTGTACG TCAACGACAT TTTCAAAGTT TTTTGTGTAA 36 0 

AAAAACGCTT CTTATTCCCT TTTAT CATAT AAGTGTCTAA TAGTTGTCAT AAATAGTGkT 420 

15 AAAGCATTTA AAAAGGTATA GGAGTTATAA AGTTTACAAC GCCTATACCT TCTGAAAAAG 48 0 

AAATTATAAA AGCTTGTTAC ACCGCATATT CTTTCAGTCA GCGACTACCA ATATAACATT 540 

GTAGCCCtAA GACATTGCTT GACGCCTCAn TTACAACAAT TTTTCAAAAT CAGCAGCTAC 60 0 

20 

CTACTGACAC AACATAACAC AACCCnTACA CTAnCTATCG TGTCATGTAA TCTTGCATCC 66 0 

GATCTTGCAA CGCTGTAAAT GTTTCGA 687 
2s (2) INFORMATION FOR SEQ ID NO: 6 09: 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 

35 GTGTTGAATA CTTTGACAAA ATGAAATCAT TTG TTCTTTC GTTTCAAATT TAACTGTTTG 60 

AATTAAATCA GTTCGTTTTT CGTAGTACTT CGGCGTTGTG TTCATATTCA TTTTTTCTAA 120 

AAATAAACTA GTAAACAATG CACCTTTAAG ACTCTGACTG ACAACGTGTG GTG CTAAAAA 18 0 

40 

GAAACCTTGA TACATTTCAA GCAATGCATT TAATGATGCA CCCGCTTCTT TACCAATACC 24 0 

AGGTGCTGTC AATCTATAAC CACATCGTTC AATTAAATCT TTTCTACCAG CAATGTATCC 30 0 

45 ACCAATCTTA GCTAAACCAC CGC CAGGGTT TTTAATTAAT GATCCTGCTA TTAAATCGGC 36 0 

ACCACATTCT ATAGGTTCAC GTCTTTCAAC AAATTCCCCA TAACAGTTAT C CACAAATAT 420 

TAAAATATTA GGATGCACGT TTTTCAACCT AGTAATTACC TTTTCAATTT CATCTAGCGG 480 

so 

AATTGAAGGT CTTTGATCAT AGCCTTTCGA ACGTTGAATC GCTATTACTT TGGTGCGCTC 54 0 

AGAAACCCCa TCTAACACAC TTTCAATATC GATCTTACCT TCTTTAAGTG CAATATCTTT 600 

ATACGATmCG CCATGCTCCA TTAAACTTTC AATAC CATTT CCGTTTACGC CAATGACTTC 660 
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ACTTTGTAAT GCAATAGTAA TCGCATGCGT ACCTGAAATA ATTTG CGGAC GAACAATTGC 780 
ATCTTCTGCT TTAAATGCCT GCGCATATAT TTCTTAATTG AACGAATTTA GAACTAATCG 84 0 

TAA 84 3 

(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 

GATTATTATA AACCGAATGA AACAAAAAAG TTTTTACATA TTTTCAACAA CTTTATTTCA 60 

ATAAAGCATC AAGTTGATCT AAAGTTTGAT TCATACCTTG TTCAACACCC ATGTTTATGA 120 

CTTGTTGAGC GGCTTCTTTT GTTGGAAATA CCGATGTTGA TGTCACTGTC GTTTTCGTTG 180 

TATTGCTCTT GGAAAATGAC AAAGTAATTT TCATGCTTGG CATTTTTGTA TCTTTTGCAC 240 

CTTGAGGTGT CGCAAATGAG TCAATATATT CAATTAAATA TGGACGCTTA ACCGTTTTAT 300 

ATTCTGCTAA TGTATAGCTG GTCATTGTAG GTGTTTTAAT CGCATAAAAT GCATCACCGC 360 

CTGAAACAGC ATTAAAGCGA AACACTTTGG TACTAGCGTC TTTTGGAtGA AACCATTTTT 42 0 

CAAATAATGC TTTkGTCGTA TAGGCATCAA ACACCTTTTC AATCGGTGCT TCTATCGTTC 4 80 

TTGAAAAAAT AATTTTATTA TCTTCAACTT TAATCGTCAT CTTCCCACTC CCTATCTTTT 54 0 

ATATTTACTC ATCTTAACAT GCATTGAGTA ACATATGTTA CCAAATCATT TGTTATAAAC 600 

TATAGTAATA TTAATCATTC CCTCTGTnGA AATTGATAAC TACGAAATAA AAAACACACT 660 

CTATTCAGCT ATTAACAGTT GAGTAGAGTG TGCAAACCTG TTGAATTAAT GCGCTAACAT 720 

TTCTTCTTTG ATTTGGTCTT nATnTAATTT TGAAGGATAA TATGTTGGCC AGTTA 77 5 
(2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 8 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 
ATCAAAATTT GATACTAAAG GGCTCACGCT ATTTTATGTA TTTATTGGAT TGATTATGTT 60 
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TGTGGCAATG TGTTTATTTA AAGTAGAAAA ACATGTTTCT TCACCATTTT TACCTGTGGT 180 

TGAATTTAAT CGTTCGATTA CTTTAGTTTT TATAACTGAC CTTCTAACAG CTATTTGTTT 24 0 

AATGGGATTC AATTTATATA TTCCAGTCTA CCTTCAAGAA CAACTAGGAT TATCTCCATT 300 

GCAAAGTGGA TTGGTTATTT TTCCTTTATC TGTAGCTTGG ATT A CATTG A ATTTTAATTT 360 

ACaTCGAATT GAAGCAAAAC TATCAAGGAA AGTTATT T AC TTACTATCAT TTACATTGCT 420 

ACTAGTAAGT AGTATTATCA TTTCATTTGG TATTAAATTG CCGGTACTTA TAGCATTTGT 48 0 

GTTAATTTTG GCAGGATTAA GTTTTGGATA TATTTATACG AAAGATAGTG TGATTGTCCA 540 

AGAGGAAACT AGCCCATTAC AAATGAAGAA AATGATGTCA TTTTATGGAT TAACTAAAAA 600 

TCTTGGGGCA TCAATAGGTT CAACAATAAT GGGATATCTT TATGCGATAC AATCAGGAAT 660 

CTTTGGTCCA AACTTACACA nTGTGTTAAG TGCTGTTGCT G TAATTAG CA TGGGCTTATn 720 

GTTTTATG 728 
(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 612: 

TATCAGAGCA AATAAGCAAG TTAGATAAAA AGATTTCTGA CTTACAATTA ATTAGAC GCT 60 

CTGTATGTGA ATTTATTAAA GGACTCTCTC TAATAGATAC CAGCATTTTA AACAAGACAC 120 

TACAGTCACA ATATGATAAA GAAGCATCTA TAAAATATGG TCATACGAAA GCATATCAnT 180 

CATTTATTAG ACGTAAAGAC AGCTTACAAT CGCAGGATAT CAGACATAAA TTGACAACTA 240 

TCTTCAATAA ATTTAATCAT ATGTCTTTGA GTCATTATCC AATCCAAGAT TGTAGTGATC 3 00 

TCGTATTTGA GTGGAAGGCA TTTATGAACA CTATCGCTGA TTTTGmTGAT GAAACATTAT 360 

GCTGTATTGC TAAAACATAT GAAGATGATA CGCGTTTCAA AGATTACTTT AATTCATATG 420 

ATAATCAAAA TTTAGCATCA TACATTTCAG AAGCTGTTAA TTATTTTTTG AGCAATGTGA 4 80 

ATAAGAG CG A CAATTTTTAA TCCTCATACA GATGCTACAA CACTGAATAC ACCAACTAAA 540 

ATAATTGTTC AAAGTAGAAA ACTAATCATT TATTCTCTAA ATCAATAACT ATATTAAAAG 6 00 

TTATACCTTT GCAAAGCGAA TTAGTATAGG TTACCGAAAG GAGAAAGGAT TAGGTTCCAT 660 

TCGATTTATA AAAAAATATA TTTCGAAACA TACAAAAGCG CCAGAAGATA ATCATTT CCT 720 
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TCAATGCATA TTCAACTAGC TCAGGTTTAC TCTTTAAGCC AAGCTTTGTC ATAATATGCG 84 0 

TCTTATGTGC TTCTACTGTT TTCACAGATA CAAATAATTT CTCTGCAATT tCTTTATTCC 900 

CGTAACCTTT GGC 913 

(2) INFORMATION FOR SEQ ID NO: 613: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 
ACAAGTGTTG ATGTCATAGT AGCGTCAAwT GTGCTTTATT TCTGGGACAC ATTTAAATTT 60 

TTCCACCCCA TTACTTGGAG CTTAATCTTT ATTACAATTT TACT ATT ATT AAACATTTTT 120 

TCTGTAAAAT CATTTGGAGA AACTGAGTTT TGGTTATCAT TGATTAAAGT GTTAACAATT 180 

ATCGTATTCG TTATTTTTGG CTTTTTAATG ATTTTCGGTA TCTTAGGTGG TCATACATAT 24 0 

GGATTTGAAA ACTATACAAA AGGCCAAGCA CCGTTTGTTG GTGGTATCTC TGGTTTCTTA 3 00 

GGCGTATTAT TAGTCGCCGG ATTTTCGGTT GGTGGTACAG AAGTAGTAGC AGTAACTGCT 3 60 

GGTGAATCAG ATGACCCTAA AAAGTCTATG CCTAAGGCAA TTAAACAAGT ATTTTGGCGT 4 20 

ATTCTTTTAT TCTATGTCTT ATCAATTGCA GTAATTGGTG CAATTATTCC GTACACAGAT 4 80 

CCATCATTAT TAAGAGCAAG TAGTTCAATA AGTCAAAGCC CATTTACAAT TGTATTCGAT 540 

AGAGtAGgCA TAGCCTTTGC AGCATCAGTA ATCAACG CGG TTATTTTAAC TTCATTATTA 600 

TCCGCTGCAA ATTCAGGTGT TTATACAACA GGCAGAATGT TGTATTCCTT AAGT 654 
(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 

TCGATCTTTA TTGATAATAA TTAAATTGTC GCCTTTAAAA TGTGATATTA ATCCTGCGGC 60 

AGGnTGTACA ACGAGTGATG AACCTAGTAC AACAAGGGTG TCAGCATGTT CAATTTTATT 120 

TAATGCCCTT ATGATGGTAG GTTGATCTAA CATTTCACCG TATAATACGA TGTCCGGTCG 180 
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ATAAGATTTA TGACATACAT TACAATAAAA ACGATTTAAC GTGCCATGTA ATTCATCAAC 300 

ATGTTGACTT CCAGCGTCTG AGTGCAAACC ATCGATATTT TGCGTGATGA CACCTAAAGA 3 60 

5 

TTGTTGATTA CGTTCTAATT TTGCAATCCA ATCATGAACG ATATTGGGCA TCGTATCGAC 420 

AAATAGTAAG CGCTTATGGC AGAAATTGAT AAAACCTTCA GGATCATCTT CTAAATAATC 480 

ACGGCTTAAC AAGTATT CTG GCGAAAGCCC ATCTTTTGaA ATTTCATCAA ATAAGCCACC 54 0 

10 

CATTGaACGG AAATCTGGAA CGCCACTTGC GACAGATACA CCAGCACCTG TAAAAAATGT 60 0 

AATACGATTC GAACTATCTA TAATATGTTT TAGTGTCTCT AA 64 2 
'5 (2) INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1210 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 

TTTCCCCnCC CCnCCAAATA TCCAAnGGAA CTTTAATAGT CCAATTGGCA CAGTAAAACT 60 

ATGGCATTTT ATAAGTATAA TATATCTGTA ATTTATGGTC AATTAGTAAA TTGTTTTTTA 120 

TTTGAAACAT ATTTCACATc AAAATCACAA AGACTTTTAG ATTTTGtTCT AAAAATCTCT 180 

TAATAATTTA TTTAATGAGA AG AGTTG CTT ATATAGTAAA TTGTGAAGCC GTTAAAACAA 24 0 

CGTTACAAAA CCTATATCTT TAATACGGAA CCATATGGTA TGAATCAAGG AATACTTAAA 300 

CTAAAACTTC TCTATCAGAT TTATTTGTTG CGAAATCAAC AACTTTAATT GCTTGCCCTT 360 

CATTTAATGG ATAATTTGCT TGCGTAATTT TAACTTTTAC AATTTGACCT ATGAGTGATT 42 0 

CGTCACCTTC AAATTGTACT TTCATATAAT TATCTGCATA TCCAACTAAT GTACCTTCTG 480 

TGtCACCCTG TTCCTCAGGA ATTACTTCAA GCACATCTTG ATCAAATTTA GACGCATATA 54 0 

ACTTTCCGAG TTGATTGCTT AGCGTAATTA ACTTATGCAC CCGTTCATTT TTAATTTCTT 600 

CATCAATTTG GTCATCCATT CTTGCAGCTG GCGTGCCAAT TCTAGGAGAA TAAGGGAAAA 660 

CATG CAGTTC AG AG AA CTT A TGCTTTACGA TAAAATCATA TGTTTCTTGG AACTCAG CTT 720 

CAGTTTCACC TGGGAAACCA ACAATTACAT CACTCGTAAy TGCCAAGTCT GGTAAAGCTT 7 80 

TATGCAATTT TGTTAATCGT TCTGAAAATC TATCCATTGT ATACTTACGT CTCATACGTT 840 

TTAATACTGT ATCTGAACCA GATTGTAATG GAATATGCAA ATGACGCACA ACTTTTGTTG 900 

AACGTTCTAA AACGTCAATT ACTTCATCTG TAAGTTGACT TGCTTCAATT GAAGAAATTC 960 
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CTTTTAAATC TTGACCATAT CCACCTGTAT GAATTCCCGT CAATACAATT TCCTTATATC 10 80 

CTGAATTCAC TAGTTGCGTC GCTTGTTCAA CTACTTTTTC CGGATCTCTT GAACGCATTA 114 0 

AGCCACGAGC CCATGGAATA ATAcAGAATG TGCAGAAGTT GTTACAACCT TCTTGAATTT 12 00 

TTAATGACGC 1210 



(2) INFORMATION FOR SEQ ID NO: 616: 

to 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
is (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 

20 TAATAAAATA CAATACTTTT CAATACAGAG AATCCCG CAA TGTCGGGATT CTTATTTATG 60 

CTGATTTTGT TTTTGTCTAT GATCAGGGAC TTtCAGGGAC TCAATTAATT AT CA CATTCA 120 

TATTTTCTTA TGACCAAATT GATACATATG ATGCTATTTA ATGAGAAAAT TAGGCATCAC 180 

25 

TTGGTTATTG AATTTCCTTC ATTAAACTTT CCAGCTCAAT TTAATAGTTA GTCGACTATT 24 0 

ATTCATTAAA CACTTTTTAA TCATAAAAAA GTGTTTTTGa TAATTCACTa CCaAAAACAC 300 

30 CTTCTTACTT ATAATTCTAT TTGTTCACCA TTTCTAATCT TATCGGCTAA ATCATTCAGT 3 60 

TTTCTTAATC GGTGATTTAC ACCTGATTTT GAAATTGGAC CAGTTGATAC CATTTCTCCA 4 20 

AG CTCTTTCA ACGAAATTTC TTGATGTTCT ACTCGAATTC TAGCAATCTC TCTCAACCTG 4 80 

35 TCTGGTAAAT TTTCAATACC AATTTCTTTA TCAATCAATT TAATGCTCTC AACTTGTTTC 54 0 

ATCGCAGCAC TAACTGTTTT ATTTAGATTG GcCGTTTCAC AATTAACGAG TCGGTTAACA 6 00 

GAATTACGCA TAT CTCTTAC AATAC GTACG TCTTCAAATT TTAATAACGC TT 6 52 

40 

(2) INFORMATION FOR SEQ ID NO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 8 base pairs 
4S (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

TnACCACTTT AAAATAGCGC TTAATAAAtG AAGGGGGCAA GTCTTATGAC GtTTTACaAT 60 
TTCATCaTGG GTTTTCAAAA TGrTAACACA CCATTTGGTA TATTGGCCGa ACACGTTAGT 120 

55 
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TCTAATTACA CAGATCATCA ATTAATTGAA ACTACAAATA GAGCTATTAG CTTATATATG 24 0 

GCAAATTAAT TTGAGTAGTA CCAATTATGA TGTATTAGTG CATCCCAAAT ATCTTTTGTT 3 00 

TTAAAGTTTA TTTCATCATT TCTTATCGAA AATGGTGTAA TAATGTCTTT ATCTAACCAA 3 60 

GTGTTGATAA GTTCATTTGG TACACCATCT AACAACATTT CACTTTTACT AATTATAAAA 420 

CATTCCCAGT CAAGTGAAAC ATTTTGTGGA TTCACATAAT TACATTGATT ATGATTATCC 4 80 

ATAAACACTC ACTCCTTTAA AATTCTGTAC TCTTCATTGC GTTTTACCCC GTCACATTAT 54 0 

CTTTTAAACT AAAATCATCA TTACTTATGA AAAAAATGTA CATCAAAAGC AAAGGTTTTC 600 

GCTACCGAAA AAGTTTAAAT AATGGTttAA TATATttGGT ACTCATTTTA ATAAAAAGAG 6 60 

AATACATTTT GAGCTATCAA TACTTTTTAT TGAAGAGGTG TTATTsyTGG CTAAAACGTT 720 

ATATTTAATG CGCCACGGAC AAACTTTGTT TAATTTTAAG GGACTAATTc AGGGATTTGG 7 80 

AGATTCGCCG CTAACAGA 7 98 
(2) INFORMATION FOR SEQ ID NO: 618: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 
ATACGCTCAA TTGATAAAAA TTTAATATCT GCCATTTGTA TACGCATCGC TTCGAATGTT 60 

TCCGTTGCAA TATCAAATGA TAATTGTGAC TGGAACCTTA AACATCGAAT CATACGTAAA 120 

GCATCTTCTT GGAATCGTTC CTCAGCTATA CCTACAGTTC TTATTATTCG ATTATTAATA 180 

TCTTGTTGAC CATCAAAATA ATCATACAAT TTGTATGCTG TATCCATTGC TATCGCATTC 240 

ATCGTGAAAT CTCGTCGTTG CAAATCTTCG TATAAATCAC GAACAAATGT AACACCACTT 3 00 

GGTCTACGGT GATCGACATA ATCTTCTTCA GCCCGGAATG TTGTCACTTC ATAATTTt CA 3 60 

TCATtAAAaa CTACATTtAT CGTGCCAtGT y CTTnAC CTA CAGGTATCGT ATGACTAAAG 4 20 

ATAGATTCTA TTTCATCCGG CGTTGCACTT GTTGTGATAT CTATATCATG AATATTTCTT 4 80 

CCCATGACAT AATCTCTTAC AGAGCCACCT ACATAATATG CTTCAAAACC ATTGTCTTGA 540 

ATTTGTTCTA ATATAGGCCT TGCCTGTTCA AATAATGATT TATCCATATT ATTACTCGCC 600 

TTTACTTTTG TTATG CTCAT TTAGCATTTT TTGATAATAA TACTCATATT GATCTGTAAT 660 

AAGTTCTGAT CCAAAACGTT CAG CAATATC TGCTAGCATG TTTTTCTGAA GTTTGTTGTA 720 
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ATCTACGACA AATCCAGTTT CAC CATGTTT AATAACCTCT TTAATTCCAC CGGCATTTGA 840 

ACCAATTGGA ACGACGCCTG TwTTCATAGC CTCAAGTAAA GTTAGTCCAA AGCTTTCTTT 9 00 

5 

TTCACTTAAT AATAATACTA AGTCAGATAA TTGGTAAAAT TCACTTACGC AATCTTGTTT 960 

CCCTAAAAAT AAAACATCCT CTTCTACGTT TAACTCTTTC GTCAATTGAC GCATTGGCAC 1020 

w TAATTCAGGA CCATCTCCAA GTAAAATTAA TTTACTAGGT ATCTTTTCAC GTACTTTTGC 1080 

AAATGTTTCT ATAATAGTAT CTATGCGTTT TACTTGTCTA AAATTCGATA CATGTATTAA 114 0 

CACTTTTTCA TCTGGTGCTA TACCAAATTG TGATTTTAAT GCTGTGTTAT GTTTAGTTGG 1200 

15 AAACTCATTT TCACGTACAA AATTATAAAT CGGTATAATT TCTTTGTTAG TTTCAATAAT 126 0 

TTCATGTGTT TCTTGTGCTA AAGATTTACT CACACTTGTC ACAATATCAC TTTTTTCAAT 13 20 

GCCAAATTTA ATTGCACCTT GGAGTGAATG ATCATAGCCC AAAACAGTAA TATCAGTACC 13 80 

20 

GTGTAGCGTT GTCATAATTT TTATATCTTT ACCTGACATC TCACGAGCTA AAATCCCACA 144 0 

AATTGCATGA GGTACAGCAT AGTGCATATG CAACAAATCA AGATCATATT CTTTAATAAC 15 00 

2s TTCAGCGATT TTAGTACTTA ACGTAATATC ATACGG TGG A TACTGAAATA CTGCATATTG 1560 

ATTCACTTCA ACTTGATGAA AAATCATATT CGGTAATGGT TTTCTTATTC TAAACGGGAT 16 20 

ATTTGAAGTG ATAAAATGTA CTTCGTGACC TCGCTCTGCT AATTTAATTC CTAATTCTGT 16 8 0 

30 GGCAATAATT CCAGAACCAC CCATGGACGG GTAACATGTT ATACCTATCn TCATTCGCTT 174 0 

GCCCATCCTT TCTTTCTATT TCTCnCTATG ATnCTCGATG CGTAGA 1786 
(2) INFORMATION FOR SEQ ID NO: 619: 

35 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

45 ACAGGTATGG ACTTnGCTCA AATGACACGA CATTATTTAT CAAGACCTAT TG CTATAATC 60 

TTTTGGATCA TTGCAGAACT AGCAATTATC GCTACAGATA TTGCTGAGGT TATTGGTAGT 12 0 

GCTATTGCTC TTAATCTCCT ATTTAACATA CCTTTAATCG TCGGTGCACT AATAACTGTA 180 

50 

CTTGATGTAT TTTTACTACT TTTTATAATG AAATATGGTT TTAGAAAAAT TGAAGCTATT 24 0 

GTTGGTACAT TAATTTTCAC AGTGTTATTC ATCTTTATAT TTGAAGTCTA TATTTCATCA 30 0 

CCACAGTTGA ATGCTGTGTT AAATGGATTT ATACCACATA GTGAAAT CAT TACAAATAAC 360 

55 



1547 



EP 0 786 519 A2 

TTACATTCAT CAATTGTACA ATCTAGAACA TACTCAAGAC ATAACAATGA AGAAAAAGCG 4 80 

CAAGGaTTAA ATTTGCTACG ATAGATTCGA ACATTCAGTT ATCAATCGCA TTTGTAGTCA 540 

ATTGCTTATT ATTAGTGtTA GGAGCATCAC TATTTTkCAA CTCAAATGCT GACGATTTAG 600 

GTGGTTTCTA TGATTTATAT CACGCCTTAA AAACTGAACC TGTACTAGGT GCAACAATGG 660 

GTGCAATCAT GAGTACATTA TTTGCAGTTG CATTATTAGC GTCAGGTCAA AATTCAACGA 720 

TTACTGGTAC TTTAGCAGGA CAAATTGTAA TGGAAGGATT TTTAAGATTA CACATACCAA 780 

ATTGGTTAAG ACGTTTAATT ACACGTTCTC TTGCTGTCAT TCCTGTTATC GTATGCTTAA 84 0 

TCAT 84 4 
(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 

TGTATTGCAG TCATGCCATA AATTTTTTGA AT AATAAA CG GAGACGCAGA AATATAAGTA 60 

AATAATATTA CAAATGTCAT AC CTTGGAAT GAGCATTGGT AATACAAAAC GTGGCGTCTT 120 

CAnTAATATT TTGAAGTTTT TAAACATTGT CTTTAaTCCA CTACTTGACT CACGATTTGT 180 

CACTGTTAAT GATTCAGGTA CTTTTAATAA AGAACCTATG ACCATGACGA AGCCAAAGAT 24 0 

AGTCAGAATG ACAAAGACCA TACGCCAGAC AGAATAATTT AAAATTATGC CCCCTATTGT 300 

TGGCGCAACA ACTGGTGCAA TACCATTAAC AAGCATCAAT AATGCCATAA ACTTAGTTAG 3 60 

TTCATTACCA CTATACATAT CACTTGCTAT AGCTCTTGAA ATAACTGCTG tGCGCCACCT 420 

GTCACTCCTT GAAGaAATCT TAATGCAACC ATCAGCCAAA TATTATGTAC AAAAACAATA 480 

CCTAAACTTG CTAATGTAAA AATAATCATA GCTATAATAA GCGGCTTTCT GCGCCCAGTT 540 

GAATCTGAAA TTGGACnAGC AACCAAATTA CCAA 574 
(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AAGCAAGAGA GAGTACAACA TTTATATGAT ATTAAAGACT TACATCGATA CTACTCATCA 60 

GAAAGTTTTG AATTCAGTAA TATTAGTGGT AAGGTTGAAA ATTATAACGG TTCTAACGTT 120 

GTACGCTTTA ACCAAGAAAA TCAAAATCAC CAATTATTCT TATTAGGTAA AGATAAAGAG 18 0 

AAATATAAAG AAGGCATTGA AGGCAAAGAT GTCTTTGTGG TAAAAGAATT AATTGATCCA 24 0 

AACGGTAGAT TATCTACTGT TGGTGGTGTG ACTAAGAAAA ATAACAAATC TTCTGAAACT 300 

AATACACATT TATTTGTTAA TAAAGTGTAT GGCGGAAATT TAGATGCATC AATTGACTCA 360 

TTTTCAATTA ATAAAGAAGA AGTTTCACTG AAAGAACTTG aTTTCAAAAT TAGACAACAT 42 0 

TTAGTTAAAA ATTATGGTTT ATATAAAGGT ACGACTAAAT ACGGTAAGaT CACTATCAAT 48 0 

TTGAAAGATG GAGAAAAGCA AGAAATTGAT TTAGGTGATA AAT 52 3 
(2) INFORMATION FOR SEQ ID NO: 6 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622: 

AGAAGTGTGA nAAAAATTTA AnAGAGATAT GCACATAGAT GACGCATTGC TATATCCAAG 60 

CAATTGAGAA AGCTGCTGAT GCTCCAAATC aCGGAATGAG GGAACCATGG AGAGTTGTGC 120 

ATGTTCCGAA AGACAGATTA GGAGATATGA GTAAGGATAT TTCTAAATTT GCATTTCCTA 180 

ATGAATTAGA TAAGCAACAA TGTCATTATG ATGCAGTTAC GAAACTAGGT GGCATGTTAT 24 0 

TGCTTATTTT AAAAACAGAT CCAAGACAAC GTCAAAATGA TGAAAACTAC TTTGCATTTG 3 00 

tGCATATG CA CAAAATCTTA TGTTGTTACT TTATGAAGCG GGAATAGGTA CATGTTGGAA 360 

aTCGCCATTA TATATCTATG ATCCTAAAGT AAGnAAACAC TTGGTATAAA GnAAGATGAA 4 20 

GTTCTTGCTG GATTCTTATA TTTAACGGAT TTAGAAGnAG ATATGCCTAA AGCACCACGT 4 80 

AAAAATAGAA ACTTAATTAC ATTATATTAA TATGTATAAT TATAGAAACA TTAATAAAAG 54 0 

CTGAGTCATG AATTGATGGA CAT CTATCGA GTTAGAGATT TAATCTAACT TACTAGAGTC 6 00 

GGTACAATAA CAGTCTCAGC TTTTTATTGT G CAGT ATATA CACATTTTTA TTTTAGTATT 66 0 

TATTtAAAAG TTtCTGCTaA AAATGATTCA ACTTGTTCAG GTGACTTAGC ATTTGCTGAA 720 

TGAAGGTGTG CAATTTTATC GCCGTTTTTA AAT ACT AG CA AGCTAGGGAT ACCCATAACT 780 

TCATTTTCAA CAACTACATC TTCTAATTCA TCACGATTAA CAGTATACCA TTGGTAATCA 84 0 
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CCTGCCTCAA ATTTAACAAT TACAGGTGTA TCGCTATTAA TTACAGATTT AAATGATTCA 96 0 

TTACTTTTGA TTGATTGCAT TGTAACAACT CCTCTAGATA GTTTAATAAT TTTT AT TATA 102 0 

GCTAAATTTA TATCATAATA AAAAATTTTA GCTTCAAAAT GAAAGCCTTT TGCTTTGGAA 108 0 

AATGATATAT TTATTTTAAA TACATAAAGG AGGTTGCAGT CGTATGATTA AATTTTACCA 114 0 

ATATAAGAAT TGTACAACTT GTAAAAAGGC AGCAAAGTTT TTAGATGAAT aTGGCGTAAG 120 0 

TTATGAACCA ATTGATATCG TTCAACATAC ACCTACAATA AATGAATTTA AAACAATAAT 12 6 0 

TGCAAATACA GGCGTAGAAA TTAATAAATT GTTTAATACA CACGGCGCGA AATATCGTGA 132 0 

GCTTGATTTG AAAAATAAAT TACAAACTTT ATCAGATGAT GAAAAGTTAG AGTTGTTATC 1380 

ATCTGATGGT ATGTTAGTAA AGCGTCCTCT AGCAGTAATG GGCGATAAGA TAACATTAGG 144 0 

ATTTAAAGAA GATCAATATA AAGAGACTTG GTTAGCGTAA GTGaAATGTA AGCGTTTACT 15 00 

AAATATCTCG ATATTTAGAT TCATTACATG TAAAATGAAA TAAGCTATAC AATTGTTAAT 15 6 0 

TTTTATAAAT ATAGTTGAAT AGCATCTAGC CTTATGGCAT CATTAATGAT GTAAAGATTA 1620 

ATTAGGAGGG GATTCTCTTG GCAGTACCAA ATGAAyTGAA ATATTCAAAA GAGCATGAAT 16 80 

GGGTTAAAGT TGAAGGTAAT GTAGCAATAA TTGGAATCAC AGAATACGCA CAAAGCGAGT 174 0 

TAGGTGATAT TGTTTTCGTT GAATTACCAG AAACAGATGA TGAAATTAAT GAAGGGGATA 1800 

CGTTTGGTAG CGTAGAATCA GTTAAAACTG TATCAGAATT ATATGCACCA ATCTCTGGTA 18 60 

AAGTAGTTnA A 1871 
(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 
TTGCTGTAGA AGATAAAGAG TCCCATCAAT GGATAGGCTT TATAGGTTTG AATTATATTC 
CAGAAACAAG CGATTATCCA TTTAAAGAAT TACCGCTTTA TGAAATAGGT TGGCGCTTGT 
TGCCAGAATT TTGGGGAAAA GGATTAGCAA CTGAAGGCGC AAAGGCAACA TTGAAGTTAG 
CAGAAGAACA TCAAATATAC GATGTCTATA GTTTTACAGC AGAAGCAAAT AAAGCTTCAC 
AACGTGTAAT GGAAAAAATT GGCATGACAG TGTATGATCA TTTCGAATTA CCCAATCTAA 
GTAAGTATCA TTTATTaAAA AGGcAAGTGC GCTATTACAT TAATCTTCcG AAAGTGGAAA 



120 
180 
240 
300 
360 
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ATTAgGGGGT GtTTnGTkTn ATTTTTTTAA n 
(2) INFORMATION FOR SEQ ID NO: 624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 

)5 CGATAGTTAT CTAAAGCCAT TTTnGATTGT GTTATGAAAT CTAATGATGC GTGATAATTT 60 

AATGCrACAT AACGrTaATA TAAAATATCA ATAGTGAACA TTTGAGCAAA TAATGAAGTT 120 

GTTGCTCCCA TATGCATCTC ATTTTCATCA GTTTTCCCAT AAGTTAAAAC AATATTTGAT 180 

20 GCCTGTGCTA CGGGATTATC CCTTGTACTA GTAATTGTAA TTATAGGTAT ATGGTAGTCA 24 0 

TCAATAACTT TAACCATTGA TTGCATTTCA CTTTGCGTAC CATTGTTAGT AATAAGAATA 300 

aCACTGTCGT TCGAATTGTG AGTTGCTAAT AATGTAGCAA AAATATGTGT TTCTTGAACA 3 60 

AGTTGAATAT TAAGACCTAT TCTTGATAAC TTTTGGTATA AGTCGGTAGC AACTACAAAA 4 20 

GATGCGCCAA AACCATATAT AAAAATCGTC TCAGAACGTT TTAAACAATG ACATATTTGA 4 80 

TCAATAGTTT TATCATTTAA TTCGTTATTT GCATGATTAA GTGCGCGTGT AGTACGTGTA 54 0 

30 

TGGAGTTTAG TTCTTAAAGA TTCTGTGCTT TCGTTATTCA TTAATTCAAC ATTGTAAATT 600 

GATGATGCTT TAGGAACATA TTTAGATATA TTTATTTTCA AGTCGTGAAA ACCGCCATCA 66 0 

35 GTAAT 6 65 
(2) INFORMATION FOR SEQ ID NO: 625: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 

GATGCCAATT AAC CGTGCAT ATAATGTTGA GAAGTTAATC GAAGCAATTC AATATTATCA 60 

AGAAAAAACA AATCGTCGTG TTACTTTTGA AT ATGGT CTG TTTGGTGGTG TGAATGACCA 120 

ACTAGAACAT GCAAGAGAAT TAGCACATTT AwTAAAAGGC TTAAACTGCC ATGTTAACTT 180 

AATTCCtGTC AACCATGTTC CAGAAAGAAA TTATGTGAAA ACGG CTAAAA ATGATATCTT 24 0 
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TTCGGATATT GACGCAGCTT GTGGTCAATT AAGAGCAAAG GAACGACAAG TAGAAACGAG 3 60 

GTAAAGACAA ATGCTAGAGG CACAATTTTT TACTGATACT GGACAACATA GAGATAAGAA 42 0 

TGAAGATGCG GGTGGTATTT TTTATAATCA AACTAATCAA CAAC TTTT AG TTCTGTGTGA 4 80 

TGGTATGGGT GGCCATAAAG CAGGAGAAGT TGCAAGTAAA TTTGTTACAG ATGAGTTGAA 540 

ATCyCGTTTT GAAGCGGAAA ATCTTATAGA ACAACATCAA GCTGAAAATT GGTTGCGTAA 60 0 

TAATATAAAA GATATAAATT TTCAGTTATA TCACTATGCA CAAGAAAATG CAGAATATAA 660 

AGGTATGGGT ACAACATGTG TTTGTGCACT TGTTTTTGAA AAAT CAGTTG TGATAGCAAA 720 

TGTCGGTGAT TCTAGAGCCT ATGTTATTAA TAGTCGACAA ATTGAACAAA TTACTAGTGA 7 80 

TCACTCATTT GTTAATCATC TTGTTTTAAC GGGTCAAATT ACGCCGGAAG AAG CATTTAC 84 0 

ACATCCACAA CGTAATATTA TTACGAAGGT GATGGGCACA GATAAACGTG TGAGTCCAGA 900 

TTTGTTTATT AAGCGATTAA ATTTTTATGA TTATTTATTA TTAAATTCAG ATGGATTAAC 960 

TGATTATGTT AAAGACAATG AAATTAAGCG TTTGTTAGTA AAAGAAGGTA CAATAGAAGA 1020 

TCATGGTGAT CAATTAATGC AATTGGCATT AGATAACCAT TCGAAAGATA ACGTTACTTT 1080 

CATACTCGCG GCTATTGAAG GTGATAAAGT ATGATAGGTA AAATAATAAA TGAACGATAT 114 0 

AAAATTGTAG ATAAGCTTGG CGGCGGTGGC ATGAGTACCG TTTATCTTGC TGAAGATACG 1200 

ATACTTAACA TTAAAGTTGC AATTAAGGCG ATTTTTATAC CACCTAGAGA AAAAGAAGAA 126 0 

ACATTAAAAC GTTTTGAACG AGAAGTACAT AACTCATCAC AGCTATCACA TCAAAATATA 13 20 

GTAAGTATGA TCGATGTTGA TGAAGAAGAT GACTGTTACT ACTTAGTAAT GGAATATATC 13 80 

GAAGGTCCGA CTTTGTCTGA GTATATTGAA AGTCATGGGC CATTAAGTGT TGACACAGCG 14 4 0 

ATTAATTTTA CGAATCAAAT ATTGGATGGC ATTAAACATG CGCATGATAT GCGTATTGTA 1500 

CATAGAGATA TTAAGCCACA AAATATATTA ATTGACAGCA ATAAAACGTT GAAAATATTT 1560 

GATTTTGGAA TTGCTAAAGC TTTAAGTGAG ACGTCTTTAA CTCAGACTAA TCATGTGTTA 1620 

GGTACTGTGC AGTACTTTTC GCCAGAACAA GCAAAAGGTG AGGCAACGGA TGAATGTACA 1680 

GATATTTATT CTATAGGTAT kGTGTTATAT GAAaTGCTTG TTGGTGAACC ACCCTTTAAT 174 0 

GGAGAAACTG CAGTTAGCAT TGCGATTAAA CATATTCAGG ATTCTGTGCC AAAT GTGAC A 1800 

ACAGATGTAC GTAAGGATAT TCCGCAATCT TTAAGTAATG TCATTTTACG CGCTACAGAA 1860 

AAAGACnAAG CGAATCGTTA CAAAACAATT CAAGAAATGA AAGATGATTT GAGTAGTGTT 1920 

TTACATGAAA ATCGAGCGAA TGAAGATGTC TATGAACTCG ATAAAATGAA AACGATAGCG 1980 

GTACCTTTGA AAAAAGAAGA TCTAGCAAAG CATATTAGTG AACATAAGTC GAATCAACCT 204 0 
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AGCCAGAAGG TACGGTGTAC GAACCAAAAC CTAAAAAGAA ATCAACACGA AAGATT GTGC 2160 

T CTTATCACT AATCTTTTCG TTGTTAATGA TTGCACTTGT TTCTTTTGTG GCAATGGCAA 22 2 0 

TGTTTGGTAA TAAATACGAA GAGaCACCTG ATGTAATCGG GAAATCTGTA AAAGAAGCAG 2280 

AGCAAATATT CAATAAAAAC AACCTGAAAT TGGGTAAAAT TTCTAGAAGT TATAGTGATA 234 0 

AATATCCTGA AAATGAAATT ATTAAGACAA CTCCTAATAC TGGTGAACGT GTTGAACGTG 24 00 

GTGACAGTGT TGATGTTGTT ATATCAAAGG GCCCTGAAAA GGTTAAAATG CCAAATGTCA 24 6 0 

TTGGTTTACC TAAGGAGGAA GCCTTGCAGA AATTAAAATC GTTAGGTCTT AAAGATGTTA 2 52 0 

CGATTGAAAA AGTATATAAT AATCAAGCG 2 54 9 
(2) INFORMATION FOR SEQ ID NO: 626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

TGCTTACTTC GCCTTCAATA CGTACTAATT CATGTCCACA ACTTGGACAA TGGGTTGGCA 60 

TATGATATGT GACAGCATCC TCAGGTCTAC GTTCTGGAAT ACTACGTACA ACTTCAGGTA 120 

TGATGTCACC TGCTTTTTTC ACTACAACAC TATCACCAAT TCGAATATCT CTGTCATGAA 180 

TTAAATCCTC ATTGTGCAAA GATGCTCTTG aTACAGTTGT ACCAGCTACT TTTACTGGTT 24 0 

CTAAAATAGC AGTAGGTGTG ACTACACCTG TTCGTCCAAT ACTTAATTCA ATATCTAATA 3 00 

ATTTAGTTAC TACTTCCTCA GCTGGAAATT TATAAGCAAT GGC CCATCTA GGAGATTTTT 360 

GTGTGAATCC CATCTCATCC TGTTGATCTA AATCATTAAC CTTAATAACA ATCCCATCAA 4 20 

TATCATAAGG TAATGACTCT CTTTGGCTTG TCCATTTTTC AATATACTCT AAAACACCAT 480 

CGATATTATT TACACGCGCT CTATTTTTAT TCGTTGTAAA ACCTAATTTA TCTAACTCAT 540 

CTAATGCTTC ACTTTGCGAA CGCGCATTGA AATCAGTGAA ATCATTGACA CTATATATAA 6 00 

ATACGCTTAG CTTTCGTTTT GC CGTTAATT TAGAATCTAA CTGTCTTAAT GATCCCG CAG 6 60 

CAGCGTTTCT TGGATTnGCA AATAACTGCT CATCATTTTT TTCTTTTTCT TCATTTAATC 720 

GTAAAAATGA ACGTCTCGGC ATATATGCTT CACCACGAAC TTCTACATTT AATGGtTCTT 780 

TCATTTTCAA AGGTATCGCA TGAATTGTTT TTaAATTTTC GGTAATATCT TCACCTGTTG 84 0 

TTCCATCACC ACGTGTTAAA CCTTGaACGA AGTATCCATC AACATATTTC AATGATACTG 900 
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GTTGGTCGAA TTTTCTCAAA TCATCCTCAT TAAATGCATT CCCTAAACTT AACATTGGCG 1020 

TGTCATGGTT GACTTTATTG AAAGAGGCTT GGGCTTCACC GCCAACTCTA ACTGTTGGAG 10 80 

AATCTACAGT CTTATACTCA GGATGCTCCT CTTCTATTTT AATCAGTTCA TGAAGTAATT 114 0 

TGTCATATTC ACTATCTGGT ACAGATGGAT TATCCTCTAC ATAGTATTCA TAACTGTATT 1200 

GATTTAATAA ATCATGTAAC TCGTTCACAC GAGACGATAA ATCAGCCATC CCTTAATCCT 1260 

CCTTTTTTtC AATTGGTGCA AATTGCGCTA ACAAACGTTT TGGCCCTTGT GATTTAAAGA 1320 

TAATATCTAG TTCGATTGAG C CATTTTTCT CGTTTACATT ACTCACCATG CCTTCTCCCC 1380 

AGGCTTTATG CATCACTTTG TCACCTACAT TCCAATCAGA TGACAATACT TGTTTTTTCG 144 0 

TTGACGTTGT TCGTTGACTA AATCCGCGTT TAGCAAAAGG TTTTGCCTTA GGTTGTATCG 1500 

TTTGTCGTTT GCCACTTGAA TGATTTTCTA ATAGTGATTC TGGAATTTCC TTTAAAAATC 1560 

TGGATGGCAT ATTTGACTGA GGGCGACCAA ATAACATTCT TGATGTCGCA TGAGTGATAT 1620 

ATAACACCTC TTCAGCCCTT GTAATTGCTA CATAACAAAT ACGACGTTCT TCTTGCATTT 16 80 

CATGATCATC TTCACTCTTA ATCGCTCTAA TATGTGGGAA TAAAGATTCT TCCATCCCCA 174 0 

TTATAAAGAC AATTGGAAAT TCAAGGCCCT TAGCCGAGTG CATCGTCATT AGTGTTACGC 18 00 

CATTTTCAGT ATCTGCCTCA TCAATATCAG CTACTAACGA TAAATCCGTT AAAAAGTTAA 1860 

TTAATGACTG TTCTTCTAAT GGGGTATTTT CCTCATAGTC TTTTGGTACT GACATAAATT 1920 

CATCGATGTT TTCTAATCTA CTTCGAGATT CTAATGTATT TTCACGTTCA AGCATTTCTC 1980 

GATAGCCAGA CTTTTGTAAT ACTTCATCAA CTATTTCATG AATTTCTAAA AATTCTTGTT 2040 

CTTTTATCAA GCTTTGGATT AACTCGTAAA AATTAAGACA CTCTTGTGTC ACCTTTTTTG 2100 

aCAATCCGAT AAAATCAGCT TCTCCAAGTG CATCAAACAT ACTGATATTG TTTTGAAGTG 2160 

CATAGTTTTG AACTTTTTCA ACAGATGAAG GACCTACACC TCTTTTTGGA ACATTaATAA 2220 

TACGTTGCAA ACTAATGTCA TCATTACTAT TGGCAATTAT ACGCAAATAA CTTAATAAAT 2280 

CTTTGA 2286 
(2) INFORMATION FOR SEQ ID NO: 62 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 
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TGACATTAAA AGTTGGGTGA AnAAACAAGG CATACTTTAG TTAAGCTTGA TGAAAATAAC 120 

AATGGAATTA ATGCGATTAT TCAAAAAGAA AAAGCAAAAG ATTTAGATAT AAATT ATTC T 180 

GCTAAAGGTA CTACCAATTG TATTATTTAG TGGAGAATTA GACAAGCTGT AGCAG CGTTG 24 0 

ATTATTGCCA ATGGTGCTAG AGCTGCTGGA AAAGATGTAA CTACCTCCTT ACTTTTTGGG 3 00 

GGCTTATGCC nTTAAAAAAG TGCCACCGTT AATGTTAAAA GCAGTTGCCA AAATGTTTGA 3 60 

TTnATGTTGC CCCAAAAGAT TTACGATGCC CCTTCCCAAA 400 
(2) INFORMATION FOR SEQ ID NO: 62 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 

AATAATTTGG GCGCTnTTTG CGTCGGGATA TTATACCGCT TCCTTAATTG TTCAACATTG 60 

TAATCACTGT TTTTCAATTG ATATTTTGCA GAGTAAATTG GTACTTCTGG GTTATATGAC 12 0 

ACTTCGTCCT CTTTATAGTT TTCCAATTCT TTGAAATTCC CGTATTGTAC AAAGAAGTTA 18 0 

AATTCTTCGA TTTCTTTTTT TACTTTTTCG TCATCGATTG GTTTTAATGG AATCATTTTA 24 0 

TTAkTTTCCA TTTTCACAGG ATATCTTTTT GTATGATTGT GTGTCATTCC ATCGCTATCT 3 00 

TCAACAACTT CTCTAACAAT ATAATGC CCT TTAGCCGTTC TAGTATTTCT GTTAATTTCT 36 0 

AAAACTGCTC CTCTrGATTC CAGATTTTCT CCTTTTAATT GGATTTT CAT TTCAGATCTA 420 

ATTAGCCAAG TACCTTTATC ATCTtTTTTA AAT 453 
(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 

TCATATGCAT TTGCAAAATA AACGCCAGAA GCAAGGTTTA GAATTGGGCC GTCCGTTTTG 60 

CTCAaTTCAC TTG c ATTCAA TAATTCATGC T GAT CATGAT CAATTTGCTT ATCTAATTCT 120 

GCaATTTTCT TCATTTGCTT ATCTGATTTG TTTTCTTTCG CCATCATTTG ATCACGACGA 180 
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GCTAATGGTA CTAATACTTT ATCTCCATAT GTGTCAATAA AGTTATAAAA ATAATCATCT 3 00 

GTTTTTGATA CAAATCCAGC ACGCTCTTCA GTTTCACGAT ATAAATCTAA GAAAAGATTG 3 60 

AACTCATCAC GTTCAAGGAA TCTGACTTTA ACACCATAGT TTATCGCTTT ATTAATATTA 4 20 

CGTTTACGTT GACTATCAAA TGTCTTTTTC AATGTTTCGG GTGTTTTACC TTCAAGGTTT 48 0 

AATACGCCCA TCCATCGTAC TTGGCTCGAT GTATCATACT CAGTTGTAAA GCCATGATGC 54 0 

TCGTAACCAT GTGATTTAAA CAAGTTTACT AGGGCATCAT TTTTCTCG CG ACCTTCAAAT 600 

GGCACGATAT CTTTATCATA TAGATGATAT AACCAATACG GATCTAATTT AACATATAAA 660 

CATTGATGTT GCTGTAAATA TTTATCTAAC TCTTTTAAAT AATAATCAAC TAATCCTAAA 720 

TCTGAAAAAT CCATTACTGG ACCACGATTC GAATAGTAAA CATAACTTCC CATAGTAGGA 780 

ATTTTAGAGA AAAGGCTTGC TGCAATTACT TTGTTATTGT CGTCTTTAAT ACCTAATAAA 84 0 

ACTACTTCAA AGCCATCATT CTCACGGGTA ACTATATTTT CTTTTACTTG GAAATAATGA 90 0 

CTTTCCAATG ATGGATTTTG TACAAAGTTG TCAAATTCGG TAACAGTTAA CTCTGTAAAT 960 

TTCATGTTTT GATAATTCCt TCCTAAAAAA TTCTGTCTTT AACTTTTTTA AGTGCGGTAT 1020 

ATGCTGCGTA AACAGGTTTA TTAATTGGTT TAATAAAGTC ACCAACATAT TCmATAATTT 108 0 

CAG C ATTGTA ACCTTTTTTG AATTTAACTA CACCAGCATC TTCAGCATCT TCTGTAAATT 114 0 

TrCCACTAAC ACCATAGAAA TTATAACGGT CAATG CATG A TTTAATGCAT AATTAATCAT 1200 

TTCCCATTGC ACTGCATAAC T 1221 
(2) INFORMATION FOR SEQ ID NO: 630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 

TGGCCCAAnT AACACAAGTA ATTGCTGCTA AT GGT AA CAT CACAAAAAAT GAAATCGTAA 60 

CTACAAATGT TAAACCTTGG AATACACCAA CCATTTCTGG TAATCGTTTA CTATAGTATC 120 

TATTGTGAAT CCAAGTAATA ATAGCTGAAA TAATAATACC ACCTAGAATA TTCGTATCCA 18 0 

ATGTGGCAAT ACCTGCAATT GATTTTAAAC CAGGTACATT TTCAACGCCT TTTTCTAAAT 24 0 

TAGCGCCAAA CGTATGTGGC CATTGTGTTA AAATGGCATT TATAAATGTA TTAAACATTA 300 

AGTAACCCAT CAATGCTGCA AGTGCTGCAT GACCTGGTGC TTTTTTAGCT AAAGAAAGTG 36 0 
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TTACTGACCA AAATTTAAAC CAAAACGTAT GTTGATCTGC TAAACTCCCC ATGATTGTAG 4 80 

GATTTTTAAA TAATGTCGCA AAGCCAAGCA CTATCCCAAA GAAAGCGAAC ATTAATACCG 54 0 

GTACAATCAT TGCACTACCG AAACGCTTTA TCGCATTCAT CTTCTATTCC CTCCATATCA 600 

TCTTTCCTAA CAATACATCT AATTAGATTC ATTTATAAAT AGATGTCTTA CTATTTAAAT 660 

ATAATATATA GTAAACGCTT ACACACCTAC AACGACATTG ACGTATTTTG AAAGTATTTT 720 

GTATAATCAG ATTATCTTTT CATATAGTGA AAATTTTTTC ACGACCTTAT ATATGACATC 780 

GTTGTATTTG TAATACATTC GTTTTAAACG CATAATCAAA CCTATATCAA TACACAAATA 84 0 

TATATAATGA CATACAAGAT TTTAATGTAA TAACGATCTA TTACACATTT ATTTTCAAGG 90 0 

AGGTTGAATA TGTTTTTAGA TGAACACATT AATCGAAACT TTGATAAACT TAATGATAAT 960 

GATTTACATA TCGCTCACTT TATCAATACA CATATAGATG AATGTAAAAA TATGAAAATA 1020 

CAAGATTTAG CGCAATTCAC ACATGCCTCG AATGCAACCA TTCACAGATT TACACGCAAA 10 80 

TnAGGTTTTG ACGGTnATAG TGGATTTAAA TCGTACCTTA A 1121 

(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4005 base pairs 
CB) TYPE: nucleic acid 
!C) STRANDEDNESS: double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

AACCTTCCAT TTTACTTGAT CGATAACATC AGTTCTGCTT TACAAATCTC TTCATTAATA 60 

TGCGTCTTAA AACCATGAAA TTTAACATGT TCCGATAAAT GATAATCTTC TACAAGTTGT 120 

CGATATTCTG ACAAACCATT TCCATGTCCA TAAATATTCA ATTGAATATT GGGATGTTTT 180 

GTTACTAATT GCTTGATTAC TTCAATTTGA TGTTTAATTT GTTTATTTTC AACGAGGCGA 24 0 

GCAATTGATA TGATATGATT TTTCTCCTTT TGATTGATGT CAAATTGATA CTTTAAATTT 300 

GCCACGTAGC CAACCGGAAT ATTGATAACT GGTATTTTAT TTTCAATATA TTGTGAAATA 360 

TCTTGGCATT GCTTTTCTGT TGATACAACA ATCGCTTTAT AAC GTGTTAA ATTATTAAAC 420 

ACTGTTTTAT AAAAACTTTT TATACCATTA CCGGCACCGG ATAAATGTGT ACTATGGAGC 4 80 

ACAACAATAA CTGGAATACT TTGATTTAAT CCCGCTATAA CATTTCCTAA TTCATGAGGA 54 0 

CGATCTAATA TGATTTGATC ATTATTTTTA CATAATTGAT GGAGAAAATA TTGAACTAAT 6 00 

TCATCTTCTG TATCAAAAAA TTGTTGATGC TGGTCTTCAT TTAAGATAAC CTTTGTGAGC 6 60 
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TAGTAATTTT CGAGTACAAT CCGTTGTCCT TCACCTAAAA TTCGAGAACA ACTTAAAAAG 780 

CCTCTTCCAT CATACAATTC GCGTTTTACT TTTCTTCTTT TATGATCAAA ATAATTCACA 840 

TAATTTAATT GATGATACTG TTTAT CT AAA AAATGA G CAT ACATTACAAA TTGCTCTTCA 900 

TCATATATTC TGACAT CATT TGAATTTTCC ACAAATTTCA ATGTGTACCT ACATGACTTT 96 0 

TCCCAATACT GTATCCAGTT AACTTGCTTT GTCTTTTTAT AATTGATTGC TTTTTGAAAA 102 0 

TAGTCATACA TTGTAAATAC ATCATTTTCA ATCTGATGTT GCTTCGCATA TGTGTATGAA 1080 

TAAGGATTCC ATTTAACATA TACACATTTT GAAGATATGC CGTGTTGTTT GAACAACTTC 114 0 

AATCTATTTA TTTGCG CTTT TTCTACACCT GTAATTTTAC TTTCTAAAAT TGTTCCTAAA 120 0 

ATGTAATTCA TATTATCGCC TCATATAAGT TTTATTCCGT AT CTTTATTG TTTATTTTAT 1260 

ATGAAAAATA CATCTATTGC ATGTGTAATT ATAAAAAAAC CAGGCCACAA GGACCTGGGT 1320 

CATATTGTAT TATTTGTTTT GTTTTTTGCG ACGACCGAAT AACAATAATG AACCTAATGC 13 80 

TGCAAATAAT CCACCAAATA ACGTTGCGTT ATTTGAGCCG TTATTTTCAC TACCTGTTTC 144 0 

TGGTAATGCT TTTGCTTTAT TGTGATGGTC TTTAGTAGTA CTCATTGGTT TAACAGGTGT 1500 

ATGTTTTCCT GCATCCGAGT CTGAATCGCT GTCTGAATCA CTGTCTGAGT CTGAGTCGCT 156 0 

ATCAGAGTCT GAGTCGCTGT CCGAATCTGA GTCGCTATCT GAGTCTGAGT CGCTGTCTGA 1620 

ATCTGAATCA CTGTCTGAGT CTGAGTCGCT ATCTGAGTCT GAATCGCTGT CTGAATCTGA 16 80 

GTCGCTATCT GAGTCTGAAT CGCTGTCTGA ATCTGAGTCG CTATCTGAGT CCGAATCGCT 1740 

ATCTGAATCT GAGTCGCTGT CTGAGTCTGA GTCGCTATCT GAGTCTGAAT CGCTGTCTGA 1800 

GTCCGAATCG CTATCTGAAT CTGAGTCGCT GTCTGAGTCT GAATCGCTAT CTGAATCTGA 1860 

GTCGCTATCT GAGTCTGAAT CGCTGTCCGA ATCTGAGTCG CTATCTGAAT CTGAGTCGCT 1920 

GTCTGAATCT GAATCACTGT CTGAGTCTGA GTCGCTGTCT GAGTCTGAGT CGCTGTCTGA 1980 

GTCACTATCT GAGTCTGAAT CGCTGTCTGA TGTATCTTCT TCGAAGTATC CGTTATCAAG 2040 

TGTG AAATGA TCATGATCCG TAATTGTTAC GTCAACTTCG CCACCATCTG CATCTTTATC 2100 

ATCTTCAGTT GTATTTGTAA CTGTTTGTGT TAAGC CAGCA GGCTTTTCAA AAATAACTTT 2160 

GTATTTACCG CTATCTAAAT TATCAAAGCA GTATTTACCA TTTTCATCTG TTTTAGTTGT 222 0 

TCCAATTACT TCGCCTTTTT CATTTAATAA AGTAACTTTA ACATCTTTGA TACCTTTTT C 22 8 0 

AGTTGAATCT TGTTTGCCGT CTTTATTACT GTCGTACCAA ACATAATCAC CTAAACTATA 2340 

TTTTGGTGTT TTATAGAAAC CACTGTCTAA TGTCATGTTA TCTG CATCTT TAATGACACC 24 00 

TGTTGTTGTT AAACCATTAG AATCTTTTTC AGTATCATTT CCAGAAGTTA CTGAAGTTGG 24 60 
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TTGATATTTA CCATTTTCAT CTGTTGTAAC TGTTTTTAAA ACTTTGTCGT TTTCATCTTT 2580 

TAACGTAACT GTTACACCTG AAATGCCCTT TTCATCTTTA TCTTGAACAC CGTTTTTATT 2640 

5 TGTATCTTCC CATACATAGT CACCTAAGTT GTAAGTCGGT TTGTAGAAAC CAGAGTCAAT 2700 

AGTATCGTTA TCTTTATCTT TAATGACACC TGTTGTTGAT GTAC CATTTG AATCTATACC 2760 

TTCATCAGTT CCTGAACCTA CTTGTGTTGG TGTGTAACCT GATGGTGTTT CGAATTCAAC 2820 

w 

TTTATAAGTT CCATTTTCTA ATCCAGTAAA TTGATATTTA CCATCTTTAT CTGTTTTAGT 2880 

TGTTTGTAAA ACTTCACCGT TTTCATTTTT CAATGTAACT GTTACGCCTG AAATACCTTT 2 94 0 

7£ TTCAGTTGAA TCCTGCTTAC CATCTTTATT TGTATCTTCC CATACATAAT TACCTAAATT 3000 

ATATTTTGGT GTTTTGTAGA ATCCACTATC TAATGTCATG TTATCAGCAC CATTAATAAC 3 060 

ACCTGTTGTT GTTAAACCAT TAGAGTCTTT TTCAATGTCG CTACCAGATG TTACTGTAGT 3120 

20 CGGTGTATAG CCTTCTGGTG TAGTAAATTC AACTTTATAA TTACCATTAT CTAAATCAGT 3180 

AAATTTATAT TTGCCATCAG CGTCTGTTGT AACTGTTTTT AaCAgTTACC GTTTTCATCT 324 0 

TTTAATGTTA CCGTTACGCC AGATATACCT TTTTCATCTT GGTCTTGGAT ACCATTTTTA 3 3 00 

25 TTTGTATCTT CCCAGACATA GTCACCTAAG TTGTATTTAG GTTTGTAAAT ACCTAAGTCT 33 60 

GCAGATAAGT TATCTTTGCC ATTAACTGTA ATAACTGAAG ATAAGCCGTT TGAATCTAAT 3 4 20 

TCTT CGTTAT TACCTTGTTT TGAAGGGGTT ACTTCATAAC CTTTTGGTAA GTT TGAAAAT 34 80 

30 

TCTACACGGT AATCTCCATT AGGTAAGTTT GGAATCAAGT ATGACCCATC TTCTTTAGTA 3 54 0 

ACTGCTTCTC CTACTTTTGT ATTTGTATTA TTATCAAATA CAGTTACAGT TACATTGC CA 3 600 

ss ACGCCTTTTT CTCCTAATTC TTGAACACCG TTTTTATTAG TATCTTCCCA TACGTAGTTA 3 6 60 

CCAATTTTAT ATACTTCTTG ACCAGctCCG CCACTTTGGT TATTAGTAAA TCCTAAAGCA 3720 

TTGCCAGTAG AAACGGATTT ATTACCTGTT GAAGATAAAG TAGCCATTTG AACAAGTGTT 37 80 

40 GGGCTTTCGC TATTTGTATA TTGGAATTTT GTATTAACCA TTACAACATA AGCAGAATCT 384 0 

GCATTTCCAA AATCAATAAC AGCGCTATTG TtGTCGCCAT ATGTAATTTT CTGCAAGTAT 3 900 

TGATTTGTTA CATCTGTAAG CT CTTTAGTA TTCACATCGT ATC CTTTATT TAATGTATAA 3 96 0 

45 CCTTTAGGAA CTTGATATAT TTTTATATCT GTTACATCTT TATTT 4 00 5 

(2) INFORMATION FOR SEQ ID NO: 63 2: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 

TATTAGGTTA CTCTAGTTTC CAAAGCGGGA ATTTTAATGT TATTAACAGC AAGGACAGCA 6 0 

AAAGCAATAT CGGCGCATTG ATTGAAAATC CAGGAATATA TCCTTTTATG TCTGGATATG 12 0 

AAAACTTGAA GTTATTGAAT GAATCAAAAA ACACTCAAGA TATCGATAAA ATTGTCTCAC 180 

AACTTCATAT GGATGAATAC ATTCATAAAA AAGCTAAAAC GTATTCTCTT GGTATGAAAC 24 0 

AAAAATTAGG AATTG CTATA GCATTTTTAA ATAAACCTCA ATTCATTATC TTAGATGAAC 300 

CAATGAATGG CTTAGATCCA AAAGCTGTGC GAGATGTACG TGAATTGATT GTCCAAAAAG 360 

CGCAAGAAGG TGTTACTTTC TTAATTTCGA GTCATATTTT AAGTGAATTA GTTAAAATCA 42 0 

CAAACTCTAT CCTTATTATT AACAAAGGTA AAATTGTTAC AGAAACATCG GAAGAAGAAC 480 

TTAAACAATT TAAAGATAAT GATTTAGAAA ATGTATTACT AGAAATCATA GAAAGGGAGG 54 0 

ACCAAGCATA AAATGGGAAC TTTAATTAAA CAAGAATGTT TCAAATTATT TAAAAAGAAA 600 

TCAACTTTTA TCGCACCTAT TGTCTTTATT CTACTAATGG TTGCTCAAGG TTATATTGCT 660 

ACAAAATACA ATGAAATTTT TACGCCACAG GAATCTTTCA CATCTGCTTA TAATGGTTTT 72 0 

TCATGGTTTG CATTTTTATT AATTATT CAA GCAAGTACAA TCATTTCAAT GGAATTTCAT 780 

TACGGTACGA TTAAAAATTT ACTCTAT CGT GAATATTCAA GAACAACTAT GATTGTTAGC 84 0 

AAAATCATCA CATTATTTAT TATTTCTTTA ATTTATTTTG TTATTACAAT TATTGCTTCA 900 

ATTGTTATTG GGTCTTTATT CTTTAATGAT TTAAATATAT TTGAAAGTAG CGGTAATCAA 960 

TTATCTTTAT TGAATCAATT ATT ATT AG TT AGTTTAGGCA CATTTGTTGG CGTTTGGTTA 10 2 0 

GTTTTAAGCT TAACGTTGCT ATTATCATCT GCAACAAATT CAACGGGAGT AGCCATTGCT 108 0 

GTAGGTATTG TTTTTTATTT TGCAAGTTCT ATTTTAG CAG TTATTCAAAC GGCACTTTTA 114 0 

GAAAAAATAG ACTGGCTAAA GTGGAATCCT ATTAATATGA TGAATATTAT GCTTCAAACA 1200 

GTTGAAAAAG GCTTTAGTAA GTCGACAAAA TTAGAACTTC ATGAATTGTT TATTGGTAAT 1260 

ATTGCTTATA TTTCTATTTT CTTAATACTT GTAGTATTTA TTTTCAAGAA GAAAAATATT 1320 

TAGTAACTTA AAGTATTAAA TGTCTAAATA CACACATATT CCATCGTAAT TCAAAATCAT 13 80 

TTTCAAATCC CTTCACCCAA ATAATGGTGC GGGGATTTTT TCATCCAAAT TTTGGAAATT 144 0 
(2) INFORMATION FOR SEQ ID NO: 633: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 

GCTGACATAA TTGCATCAAA TTTCACATCC CCATAAAAAT CGCTACCACA TAACCTACGA 60 

5 TAATACCTAC AAGAACTGGA ATTAAAGATA GGAATCCTTT AAAAAATCCT TGAACGACTA 12 0 

TTGTTACAAG CAAGGTTATC ATTGCAACAA TTAAGAAACT GATATTG TAA CCTTTCATAT 180 

CTCCAGGATT TTCATACATT GCCATATTGA CTGCAGTAGG CGCTAAGCTT AAACCAATTA 24 0 

10 

CCATGATGAC TGGTCCAACA ACAACTGGTG GTAATAATTT CATTAACCAT GCTGTCCCAC 3 00 

TTAATTTGAT TAGAATCCCG ATGATGACGT ACATAACACC ACTCATGaAT AATGCTACAA 36 0 

r5 GCATGTCTCC TAAGCTATGC GTACTTAATC CCGTGATAAT TGGCGTGATA AATGCAAAGC 42 0 

TAGATC CCAA GTATGCTGGT ATTTGCGCCT TCGTTATTAA GATATAAAGT AATGTACCGA 4 80 

TTCCCGAAGC TAGTAACGCT GCTGATATTG GTAGTCCTGT TAAGAATGGT ACTAGTACTG 54 0 

20 TTGCGCCAAA CATCGCAAAT AAATGTTGTA AGCTTAAAAA TGCCCATTGC GCTGGTTGTG 600 

GTTTTTCATT TACATCTAGT ACGGGTTTTA CTGTTCGTTC AAACATTTCA TCATTTTGCA 660 

TAATATT CAT TTCCTCCGAT AATAAAAAAA TCTCTTTACA TCAGTATATG TAAAGAGACA 72 0 

25 AAAAGTGTGA CAAGTTGCTA CAAGTCATTT TCGTCCATAG AAATTGACTT ATAGTTGTCG 780 

AACATGAGGG TATTATTAGA TAAACAAGCA TATGAAAACT TATTTATCAT TCAACTCCCC 840 

CACCTTTTTC AGTCTCTCGT ACTGAATTAA AAGGGGtATT ATTTAATTAT AACTGCATTT 900 

30 

CTTTGATCcA TTtCTTCyAA ATAGACACTT ACCGTTTCCT CTTTAGAAGT AGGTAwATTT 96 0 

TTACCAACAA AATCTGCTCG AATTGGTAAC TCACGATGTC CTCGATCAAC CAAAGCAGCT 102 0 

AAACCAATTT TAATAGGTCT AGCATTTAGC AAAATAG CAT CAAGTGAAGC ACGAACCGTT 108 0 

36 

CGACCAGTAT ACAG CACATC GTCAATAATG ATGACTACTT TATCTGTAAT ATCTGTGTCG 1140 

ATGTCTATTG CGTCTTTTGT CGTAAGTGAT GACATGTGcT CTATATCATC TCTAAAGTAT 120 0 

40 GTAATATCAA TTGTTCCAGT AGGTATACGT TGTTGCTCAA TTTGATGAAT TTtATCTTGT 12 60 

ATACG ATT CG CTAAATATTC ACCTCTTGTn TTGATACCTA AAAGATTAAA TTATCAGTAC 132 0 

CTT 132 3 

45 (2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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ACCCATCTCA TCGTATTTTG AATTTAATAG ACGAGATCGA TGTATATCTG AATTCATCCC 60 

AACTATGGAT TAATGTTGGT ACATCATtAA cGCATAACCA ACATTTTGAG CAGTTGTTTT 120 

ATAAGTAACG TGATTTTTAT CTAATTGCCC TCTTAATGCG TCCTCTGTAA ATTCAACACT 180 

ATCAGAACCA TTAGAG GTCG CTTCATATAA GTTATTAGAT GCAATATGTG CTAAATCGCT 240 

ATTGATTTTC AATGGTTTTA ATCCTTTTAA TTTTCTCATT TCATTCGTTA CTTCATAAAG 300 

AGAAATTAAT TGATTTGGAT TTTGCTCAAC TGGACGCTTA TTATGCTCTT CTGACGTAGA 3 60 

ATTAGAATTT AATTGATAAG GTTCAATATC TGCTAACATT TCTTTTGTTA AAAATCGTAC 4 20 

ACTTAGCACC TTTTTCGATT GTTGATCAGA ATA CACTTGT GCATATATGT CGCCATATTT 4 80 

AATCaGTGTT TGTGTTTTTA AATCTTCATC TGAAAGTTCA AATTCATATT TTTTAC CATC 54 0 

AACTTTAAAG GACGGTTCTG GATTAATACT TGTATGATTA AAAATTTCTG CAGAATGTTG 6 00 

TCCTATTTTT AACGGACTAA CATTGACTTT CTCACCTGTA GCATACACTG AAACGATTTC 660 

TTCACGTTTA GTTGAAACAA TGTAATAACT GTTTTTGTCT TTAAACACAT AATTTTTGTA 72 0 

rCCATCTCTA AAAGGG TAG A CrCGATCTGC TTGTC CAAAT T 761 
(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

nTAAATATAT TTATATATTA TAGAATAGAA AGACCTGAAG ATTGAATATC TTTCGCAAAG 60 

CCTTTAACTG TATCTACTGA TAATTCGTTA ATATCGCGAC CTAAGTTTGT ATTCACTTTT 120 

TTCACAACAT CTGCTGGGCA TGTAATAATA TCTGCACCAA TTTCATCAGC TTGAATCACA 180 

TTGAATAATT CGCGGCAACT TGCCCATAAT AATTTAACGC CGTCTTTACT ATGCGTAACT 24 0 

TTnACAGCCT CTkTCATTAA TGGTaATGGA TCTACGCCTG TAtCTGCAAT ACGTCCTGCA 300 

AATACTGAAA CATATGTTGG CACACCTTCA GTTACTGCTT CAGTTATTTC TTTAACTTGT 360 

TCAATTGTGT AAACAGCCGT AACGTTTAAT CTCACATTGT CAGCTGAAAG TTTTTTAATT 4 20 

AAAGGAATCG TTGATTCACC TTTTGTATTT ACAATAGGAA TTTTAACAAA TACATTTTCG 4 80 

CCATATTGTT TTAAAATTGC TGCTTCTTTk TCCATAGTTT CTAAATCGTC TGCAAATACT 54 0 

TCAAATGArA TTGAAG CATC TGGAATTTCT TTCACAGCTT CTTCAGCAAA AGCTTTGTAA 600 
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TTTTTATAAG CTGCTTTCAT TnCTTCAATA TCTGCACCGk CCGCAAATAC TTCTACATTT 72 0 

AGTTTAGCCA TATAAyATAG CCTCCTTGAT TCTTATTAAA ATTTTAACAA CATCTGCATG 780 

kCTTTTTCTT ACAACCATTT GTAAAAAATG ATTTTTATTT CTTtGTT 82 7 
(2) INFORMATION FOR SEQ ID NO: 63 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636: 

TTAG TCGGTA TAACCATCGG CTAGGTGGTT TTTGTATTAA AAAAGTGGAT aCCAAAATTT 60 

ATTAATAATT ATTTTAATGT TAGAAAAAAA CTAAATAAAA ACTCGCTAAT GATATCCAAT 120 

AATATGTATA CAAAACGAGA CATATATTGC ATATGATTAA CGAGATACTG AAAATATTTT 180 

ATCACCCCTA AAATGATTAT TCATTTTCAG CGGTAATTCG ACCTAAAGTC AAACTTACAA 24 0 

TAAAAC CGAT GATAAATACT ACTAATGAAA CGAACCACAT CACGATATTA GTTGGTAAAC 3 00 

CTGGAAATAC TGCAAAGAGG GAGCCAACAA CAAAACCAAT GATTAATGCA AAAGTCATTA 3 60 

GTTTATGATG TGTTAGGAAA TACTGGATAA TTTTGCTTGA AATAATGAAT CCAGCAAGCA 4 20 

CGCCAAATCC GACTGCAAGT AATATAGGAA GACCTGCAAA GTTAAGTTTA ACAACTTCAG 4 80 

ATATTGCTAG CATGACCGTA CCATAGACGC CAAATACTAA TAACATAAAT GACC CTGAAA 54 0 

TAC CTGGGAG TAACATAGCA CTAGATGCAC ACATACCTGC AATAAAATAT TTAATAATAA 600 

GACTAGTTGA TAGAGTAAGT GTTTCTCCAG CATGTTTATC ACCATTATTC ATTAATG TAA 66 0 

TAACAATTAA GATAGCGATA CCAGCTATAA CCATCATGTA ATGTyTAG TT GTAAATGACG 72 0 

TTTTATAGTT AGAAATTTTC AATAAATATG GAACGATACC AATGATTAAT CCACCAAAGA 78 0 

AAAACATAGT TGGAATATGG TGTTGGCTTA ATAAATAATT AAAAAGATTA CTTAGTGATC 84 0 

CCATTGCCAG TAACATTCCA ATTATAATGG GGATTAAAAA TGTAAAACTT GGCCAAAAAC 90 0 

GTCGTGAGAA TATGCCGCTA ATTGAAGCGA TAAATTGATT GTAAATACCT AACAATAATG 96 0 

CGATAGTCCC ACCGCTAACA CCAGGTACCA AGTCACTCGT TCCCATAGCA AAACCTTTTA 1020 

GAATATTAAT CCATTTAAAT TGTTG CATG A ATAACTCCTT TCAAACGATT GGAATAAAAT 10 8 0 

CATAAATAGC ATCATACCAT ATTACAAATG TCCTAGTGAA ATGATAACAT ATTTTAAATT 114 0 

CATAAAATCC ATTGAGAAAT TATGTGCACT TATTATCATT TATATTTTTA AAGAGAGCGG 12 0 0 
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AGGTATAAGT AAGTTATAAT TAACTGAACG CATTATTACA AAGTCTTTTT GACTACAAAT 1320 

TAAAATTATT ATAAACTAGT TAAGAAAACT TTATATTTTA CGGAGGGAAT ATAAAATGGC 13 8 0 

ATCAACATTA GAAATyAAAG AC CTACATG T GTCTATTGAG GATAAAGAAA T CTTAAAAGG 144 0 

TGTTAACTTG ACAATTAACA CTGATGAAAT ACATGCGA 14 7 8 



(2) INFORMATION FOR SEQ ID NO: G37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1995 base pairs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: 

ATTACAGCAT CTTCTCTAGG TAGATTATTA AAAGATAGAG GTCTAAATGT AACAATTCAA 60 

AAATTCGATC CATACTTAAA TGTTGACCCA GGTACAATGA GTCCTTATCA ACATGGTGAA 12 0 

GTATTCGTAA nGGATGATGG TGCAGAAACT GACCTAGACT TAGGACATTA CGAAAGATTT 180 

ATTGATATTA ATTTAAACAA GTTTTCAAAT GTGACAGCCG GTAAAGTGTA TTCACACGTA 24 0 

TTGAAAAAAG AACGTCGTGG TGATTACTTA GGCGGAACAG TTCAAGTTAT TCCGCATATT 300 

ACAAATGAAA TTAAAGAACG TTTATTACTT GCAGGGGAAA GTACGAATGC AGACGTTGTT 360 

ATCACTGAAA TTGGCGGTAC AACAGGTGAT ATTGAGTCAT TACCGTTTAT TGAAG CGATT 42 0 

CGTCAAATTC GTAGCGATTT AGGTAGAGAA AATGTTATGT ATGTTCACTG TA CATTACTG 480 

CCTTATATTA AAGCTGCTGG AGAAATGAAA ACGAAGCCAA CACAACATAG TGTTAAAGAA 54 0 

TTACGAGGCT TAGGTATTCA ACCAGACTTA ATCGTTGTAA GAACTGAATA TGAAATGACA 6 00 

CAAGATTTAA AAGATAAAAT TGCATTATTC TGTGACATTA ATAAAGAAAG TGTTATTGAA 6 60 

TGTCGTGATG CAGACTCTTT ATACGAAATT CCATTACAAT TAAGCCAACA AAATATGGAT 720 

GATATCGTTA TTAAACGTTT ACAATTAAAC GCGAAATATG AAACACAGCT TGATGAATGG 780 

AAACAGTTGT TAGATATCGT TAATAATTTA GATGGTAAAA TTACAATTGG TTTAGTAGGT 84 0 

AAATATGTTA GCTTACAAGA TGCATATTTA TCAGTTGTTG AATCATTGAA ACATGCTGGA 900 

TATCCTTTTG CCAAAGATAT TGACATTAGA TGGATTGATT CAAGTGAAGT AACAGATGAA 960 

AATGCAGCCG AATACCTTGC AGATGTCGAC GGTATTTTAG TACCAGGTGG ATTTGGTTTC 1020 

CGTGCAAGTG AAGGTAAAAT TAGTG CAATT AAGTATGCTA GAGAAAACAA TGTACCATTC 1080 

TTTGGTATTT GTTTAGGAAT GCAACTTGCA ACAGTTGAAT TTTCAAGAAA CGTATTAGGC 114 0 
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TTACCAGAAC AAAAAGATAT CGAAGATTTA GGTGGTACAT TACGCTTAGG CTTATATTCA 12 60 

TGTTCAATTA AAGAAGGCAC ATTGGCACAA GATGTTTATG GTAAAG CGG A AATTGAAGAA 132 0 

AGACATCGTC ATCGTTATGA ATTTAATAAT GACTATAGAG AACAATTAGA AGCAAATGGT 1380 

ATGGTGATTT CTGGTACAAG tCCAGATGGA CGTTTAGTAG AAATGGTAGA GATTCCGACA 1440 

AATGtTTCTT TATTGCTTGT CAATTCCACC CAGAATTCTT ATCTAGACCA AATCGTC CGC 1500 

ACCCGATTTT TAAAT CATTT ATTGAAGCTT CATTAAAATA TCAACAAAAT AAATAAATTT 1560 

GCTAATAAAA CCGGTACTTT CATTGTTAAA CATTGAAAGT ACCGGTTTnT CGTATAATTT 1620 

TAATATTATG TTAGTGACAA GGTATGAAAT AACAATAGTG ACTTTTATAA TTCTAAGTCT 1680 

CTTGTCATTT CAATCATTTG TGTATAAATG TCATAGTATA CATAATTCAA TGCCATCGCA 1740 

TGTGGTyGGA CAATCTTATC GTAATCTTCA GTGTAGACTA TAGGTcTTGG TGTAGATAAA 1800 

TCGATAAAAT GTACGAGATG ATCAGGGAAA TCATCTGTTT TAGGTTTGTT GCTTATTAAG 1860 

ACCACATCGA TATCTAAGTC GATAAGTTTT TGAATATCTA ATGCAACTTG ATyATTATAA 1920 

AATGGTGCGA ATAATAATAC ACGATCAGTT GAGTCAATTT CTTTAAwk TC TTTAATAGCG 1980 

TaAGTTTnCG GCTAG 1995 
(2) INFORMATION FOR SEQ ID NO: 638: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 38: 

ATT AG TGATG AAAGTCAGAT TGAAGCTTTA TTAACAG CTG AAAAATATTC AGAAATGATT 60 

GGTGAATAAT CACCGTGTAA CTCCTTAATC TAAGATTGAG GAGTTTATTT TTAGTCTGAG 12 0 

AAAATAAATG ATATGAAAGA AAAATTATTA GGTACTATTA TTTGGAGTAT TGCTACATTT 180 

TATTATTCAA GAATGATGGA AATAATGAAT TTAGCTATTT TAAAAATAAA AATTGGGGGA 240 

AGTTAATATG CTAAACATTC AAGACGTTAn CATnCTTTCT AAAAAGGAGC AAAAAGCATA 300 

TAACCGTTTC GTAGAATCTG TAGAAAACGG TAATTTACCA GTACTACCAT GTATTGAAAT 360 

GGATCTAAAA GAGATGCAAG AAGAAACATT AAACCAGAGT AAGATTGGTG GAATGCCATT 4 20 

TTTAAAATCT TTTAAAGATA TACCATTAGA TGAAAATAAT GTACCAATGG TATTGCTAGC 480 

ACAGATTAAT TTGGATGATC TTCCAGAACA ACAAGAATTA TTTCCTGTAA AAGAAGGGAT 54 0 
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AAACAATATA AACTCAAGGC TTGTTTATAT AAAAGAGCCA ATTACAGATT TATCACTCGA 6 60 

AAATATTCAA GCGCATTTGA AGTCATTAGA TGCTGATAAT GAGGATATCC CGTTCAGTGG 72 0 

5 AGCATTTTCT ATAGAATTTA GATTGTCGAA ACAAACTATT ACATGTACTG ATTATAAGTA 780 

CGATGAGGAC GTGCTTGCAT TGTGGAATAA AG TCAATCCA TCCTTCGCGC TAAAATCAAT 84 0 

GTTTGGTGGT TATGATGAAT TGATGGAACC TGTGTGTAmC AywTTTACTG CTAAGgAACC 9 00 

ATTTAATCAA CTTGGTGGTT ATCCATATTT TGACCAAATA GAT C CAAGAA CGAACGATcA 960 

AGAACTGAAA ATGTATGATA GAGTCTTACT GCAAATTGaT TCTACAAGAG ATGGTAATTC 1020 

JS TTCGATTATa TGGGGTGaTT TAGGTATTG c CAATATCtTA GTGaAATCTA CTG r ACCTTG 108 0 

aGGcTAtGaa GTTTTGAATG ATTACCT 1107 
(2) INFORMATION FOR SEQ ID NO: 639: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 

ATTCATATTA TTATAAATTA TTTCTACACC ATCCCAATTG AGTTGTTTTT CATAATTTAA 60 

ATGTAATTCC ACTAACTCCC TACCAATTTC AACAAATCCA TATACATCCT TTAATATCGG 12 0 

TATTCGCGGA AAACCTTTAC TCAAATCACT TGAATATTTG TTCACATAAT ATTTATGATG 18 0 

3S CAAAATTGCA TATATATAAT ATACTATCTC TTCTGAATTA AGATTTATTT TCTTTTTAAA 24 0 

AGAATTAGgA AATATTATCT ACArGCCTCA AACTATCTTT ACCTTTGtAT GTAGCAAAGC 3 00 

CTTkGCCATT ACCAATAAAt TGGAAATTAG GTAATATGTC CGTGATCATA GCCGAGAATT 36 0 

40 CTTTATTCAT TCCCTGTCCT TGTATATAAA TCACCTGTCC AGTATTCTCC ATTATATTAT 420 

AATATCTACT TGGCATTTCC ATAATATTTT TGTCGTACAC TATCCATTTT TTTGTAAATG 480 

GTCTATGCAT AAATTTAACA ATTCTCTCTG GATTAATTGA AATATTTTTT CCTTTAGAAA 54 0 

ATTTTTGGGT AAGTCCTCGT GTCCAACTaA TaAATGTTTC ATCTTTGTTC ACTAAATTTA 6 00 

TACGTTCTCT TGAATCTAAG ATATCAATTA AT CTATCTAT TTCAGAATTA TAGTTATCTA 660 

so CAAGTAATTT TGcATTTACT AATGCTTTTT CATTCGAAAA ATTTGTTACC C AATTAT CTC 72 0 

TTGCTGaATT TACTCCATTA AATkGAkCTA AATATATAGA ATTTTCAATA TCCTTGGAAT 780 

CATACATTGG TAAATAATTC CCCATAGTTT ATGTCTCGGT GATTAATCCA ATCATTGGGG 84 0 
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AAAG 

(2) INFORMATION FOR SEQ ID NO: 64 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 0: 

CGATGTCTTT ACTATTAGAC TTAGCCATTG GTTTCACCTC TCCAAAAATT GTAAATGTGT 60 

ATCATCAATA TGAAAGTTAC ATAAAACTGA CATATTTCTT TAAAATATCA ACGCCATTGA 120 

TAACTTCCTG TTTTAATTGA TACGCTGTAA CAAAATACTA TAGTTAGTGC TTACATGTAT 180 

ATGTTAAAGC AAGCAGTGGT AAATGTAAAT TATAATTATT CATTAACTTT GCAATATATT 24 0 

AAATCTTTTA TTCATAGAAG ATAAATATCA AATCAATCAT AATTATTTGA CAACAAATAG 300 

CTAACGATTG TTTTAATCTA CATTTGGCTT ATAGCATTTT AAACCTATAC TCTATTTTGA 36 0 

TACAATATAA GTGTAAAATC AATCATAAAA AGGATATTCA ATATCTGCAT CCAAGAAAAA 4 20 

CATTACAATT ACCTTT 436 
(2) INFORMATION FOR SEQ ID NO: 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

GTTATTAAAT TCAGAGTGGT AGCAAATTAA AGTTAnTCAA GAGTTAAGAT GAATTTAATT 60 

CATGAACACG TCTATTATTT TTATAATTGT AGCAAATAAA GCTTTACATC AAGGAGGTAA 120 

TTAAATATGT TCAAAAAATA TGACTCAAAA AATTCAATCG TATTAAAATC TATTCTATCG 180 

CTAGG TAT CA TCTATGGGGG AACATTTGGA ATATATCCAA AAGCAGACGC GTCAACACAA 24 0 

AATTCCTCAA GTGTACAAGA TAAACAATTA CAAAAAGTTG AAGAAGTACC AAATAATTCA 300 

GAAAAAGCTT TGGTTAAAAA ACTTTACGAT AGATACAGCA AGGATACAAT AAATGGAAAA 36 0 

TCTAATAAAT CTAGG AATTG GGTTTATTCA GAGAGACCTT TAAATGAAAA CCAAGTTCGT 42 0 

ATACATTTAG AAGGAACATA CA 44 2 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

CCAATTTTGG TATGAATTAT ACAGATAATT CnGCGCCCGG AGGATCATTT GCTTATTTAA SO 

ATCAATTCGG TGTGGATAAA TGGATGAATG AAGGGTATAT GGCATAAGGA GAACATTTTA 120 

ACTACTGCCA ATAACG GAAG ATATATTTAT CAAGCTGGAA CTTCATTAGC CACACCTAAA 18 0 

GTTTCGGGAG CACTAGCTTT AATCATTGAT AAATATCATC TTGAAAAACA TCCAGATAAA 24 0 

GCGATTGAAT TGTTATATCA GCATGGGACA TCTAAGAATA ATAAACCATT TAGTAGATAT 3 00 

GGGCATGGTG AGCTTGATGT GTATAAAGCA TT AAATG TAG CAAATCAAAA AGCAAGTTAA 360 

TAAATCAAAG GAGTTTTTGA TTATGGCAAA ATTAGTTACT GAAAACATTT CGAAGCGGTT 420 

TAAAAATCAA GATGTATTAA AGCATATTAA TATCACTTTA GAAAATAACG AAGTTTATGG 4 80 

ATTACTTGGT ATTAATGGAG CCGGTAAAAC GA CACTTATG AAAATTATAT GTGGCATACT 54 0 

TCAACAAGAT TCAGGGGAAA TTAAATTAGA TAATAGACCA ATGACACGAA ATGATTTGCA 6 00 

CAAAGTTGGT TCGCTTATTG AAACACCTGC GACATATAAT CATTTAAGTG CACAAGATAA 6 60 

TTTGAAAATT GTGTGTTTAA ATGAAAGCGT TGATTTCaGC GAAATTAATA GTGTTTTAAG 72 0 

CTTAGTCAAT TTAAATGTCG ATAAAAAGAA AAAGGTTAAG GACTTTTCTT TAGGTATGAA 7 80 

ACAAAGACTT GGAATTGCAA TGGCaTTAAT TAAAAaGCCA GAAATTTTAG TATTAGACGA 84 0 

ACCATCTAAT GGTTTAGACC CATATGGAAT CCAAGAACTT AGAGAACTTC TAAAATTATT 900 

AACAGAACAA GGTACTAGTA TTATTATTTC AAGTCACATT TTATCTGAAA TCCAAGTTTT 960 

AGCAGATCAT ATCGGTATTA TTCATGAGGG TGAGCTAAAA TATCAG CAAA GAAATAACAA 10 20 

AGATGAAAAC TTAGAAGAGA TATTCTTCAA AATAACGAAA GGTGATTACA AATGATACAT 1080 

TTAAAGATaG AAGG TATCAA ATTTAAAAAt TCTTTCAGTA TGTATGTTTT ATTAATAAGT 114 0 

CcGcTGGTAT TTCTTTGTTT TGCTATTTTC ACAGTCTTAT TCGCCAAAAG TAATACGGGA 120 0 

ACAG CGAATA GTGTGTCACC ATATATAACT TTACTATTTA ATATTTGGCC AATTGCTTTC 12 60 

ATCCCGATrG TATTATGTAT GGCTTGTAAT TCGTTATTTA AAATTGAAAT GAGAAATAAA 1320 

TCATTTAATT ATTACTTAAG TAATAATTGG TCGATTACAA AAGAAATAAG AGCAAAGATT 13 80 

TT CATTTTAT CAATAGCATT TTTGGTACAT TGCTTTTTAG TATTTATTAT TGCTTATATA 14 4 0 
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TTGATGTATG TAGTATCTCT ACCATTGATA CCGCTCAACT TTTTATTAAC TCGATACTTT 1560 

GGTGTGTTCG TATCAATATT AATAAACTTA GTATTATCAG TCATTTGTGT CTTGTTTTTA 1620 

ACATTGAAGA GTTTATTTTG GGTGTTGCCG TGGGGGATAA TGCAGAGAAT CCCGCTTATT 1680 

ACGCTTGGTA TACTACCTAA TGGtTTAGTT GTAAACCATA ATTCAAAATA CTTTAATGAT 174 0 

CTCAATGCCT TATATATTTC GATTATTGTT AGCATCATTA TTTTCGCGAT AGTAACATTT 180 0 

TTAAATAATA AGAAAAGTTG GCGATTAAAA TGATAATTAA CGAATTAAAG TCATGTAAGT 186 0 

TGAAATTTTC TAAGCAAGCG CTCACATTTG TACCCATTAT TGTAAC CATA TTGTTTATAT 192 0 

TATTTATAAA TTGGTATTTA AACGTAAATT TATGGAATGG TCGACAAATm AGTTTGTTTA 198 0 

CAGCGAGTTT T AATG CAATT ACATCGCTAT TAATTTCTAT AAACGTCTAT CAAGTTATCA 204 0 

ATTTTGAAGA AAATATTGGT CACTTTAATC ATATTTTAGG AAAAG CTAAT AGGCTAAATT 210 0 

GGTTAAATGC ATCAATGATT TTTACTTATA CTATTACAGC CATATGTATT CTATTAGCAT 2160 

CAATT AATTT ATTGTGGCAT TCACATGATA TGAAAATAAC ACTTATGTTT ATAGGcGTaT 2220 

CATTGTTTTT CAATGTaATT ATATTACTGC TACTTTTTAT TTTTAGTATT TTCATTAAAG 2280 

ATGTAATGGC TATTGTTGTC GGAGTTTTAA TGTTTATTTT TAACGTTTAT TTTGGATTAG 2340 

AAGTGCTTGG AGATCATTCG TGGTTCTATT TACCAATCAC ATATG CTACA CGTTACGTTT 24 00 

ATATGTTTAn CGAAGGGGAG TATAC CAGTT ACATTAACAT TGGGCAATCT ATATnATTAT 24 6 0 

CACGnTGCCG AT 24 72 

(2) INFORMATION FOR SEQ ID NO: 643: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 
ATAATATAGG AATTATTTCG ATnACAGATT TTACGAATCG TGCTACGATG CAAAATGAAn 60 
ATAAAGATCC ATATGGCGAA AAGTTAGCTT ATGGAATTGC TTTTAATGG C AGTGTGGATA 12 0 

TGCAAGGGGA TAAACAAGTC ACAATTCCAA AATATAGTGT AGTTACAATT ACTGGCGAAA 180 
ATAGTAAAAA TTATCGTGTT ACCGCCGATA ATAAGACTTA CTATGTTAGT AAAGATAAAT 240 
TAGAATATTT TAACCCGGCA GGTTTATATC AAACGCATAG TTTTAAAAAA TTAGCACCAT 300 
ATATGAAATC AAATTATAGT AATTACTATG CATACTTTAA TAGTCAATTA CATAAAAAGC 360 
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CACAACAACC GATACAATTA CTTTTCAATG ATAATAATCA GTTATACGGT TTTGTTTATC 4 80 

CAATTGTAGA TAAAAAAGAA TTAAAAGATA AG TTTAATAT TAACAATAAC ATTTGGATTA 54 0 

CTAAAGTTGG GAATGGATAT TGTATTGCCA ATTTGAAAGA AGACAAATGG ATTTATATTG 600 

AATTGTAGGT GTAAAGATGC TAGATAATAT TATTTTATAT TTTAAA 64 6 



(2) INFORMATION FOR SEQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

TAATAT CGGA ATTTGATAAT GAAGATATCT AATTTTTTAA TATTCGTAGC TTTTATTTTT 60 

CTACTTATTT TATGTTTATT TTTAATCTTA CAAATGACAA ACCATTAAAA GTAGCATCCC 120 

AACATCAAAC AAAAAAACAA TTCATCAAAT AAAAATCGCT ACAAAACCAA GTCATTAAAC 180 

ACGCAATAAT TAAAATTTTC CACTCATTAT AATTCTGAAT TCCAAATGTC GAATTCCGAA 24 0 

AACCAAACTC CAAATTCCAA AAACGCAACT CCAAAATTAA AAGCATTTCC CTACCATTCG 3 00 

GGAAATGCTT TTTACATACT GgATTACTCT GTCATTAATG ATTTTACAAC GGgAAACCAT 36 0 

GTCGtCATGT ATGACCaAAG TAGCGTCGCT AtCaTaAgGt GGTTCGGATC TTTATTGGAT 4 20 

AATnAT 4 26 



(2) INFORMATION FOR SEQ ID NO: 645: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

nTTATTTACC CACAACATGT TGCGACATTA GGTAAATGGG TACCTTATTT ACTTGGTATT 60 

GTTATGTTAG GTATGGGATT AACAATTACA C CTAATGATT TCAAAATGGT CTTTAAAGCA 120 

CCTAGAGCAG TAATTATTGG TGTCTGTCTA CAATTCAGTA TTATGCCCAC ATT AG CATTT 180 

AT AATTG CAA AGTCTTTTCA TTTACCACCT GATATTGCTG TTGGCGTAAT ATTAGTTGGA 24 0 

TGTTGTCCGG GTGGGACATC AAGTAATGTA ATGAGTTATT TAGCCAAAGC TAACGTAGCA 300 
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ATATATCTAT TTG CAAATG A ATGGTTGGAA GTATCTTTCG TGAGTATGTT GTGGTCAGTT 4 20 

GTTCAAGTTG TATTAATTCC AATTGCTTTA GGTATTGTTT TGCAAATTAT TAATCGTAAA 4 80 

5 ATTGCTGAAA AAGCTTCTAC AGCTTTGCCA ATTATATCAG TTGTTGCTAT TTCATTAATT 54 0 

TTAGCAATAG TTGTAGGTGG CAGTAAGCAC CAAATCTTAA CTACAGGATT ATTAATATTT 6 00 

TTAGTAGTTA TTTTACATAA CGTATTAGGG TATACGATTG GATATTGGTT AGCTCGTCTT S6 0 

TTAAAATTAG ATCGACAAGA TCAAAAAGCA GTCAGTATTG AAGTTGGAAT GCAGAACTCT 72 0 

GGTTTAGCTG TGTCATTAGC aGCATTGCAT TTTAATCCAA TTGCAGCAGT ACCAGGCGCA 7 80 

j5 GTGTTTAGTT TCATTCATAA TATAACAGGG CCTATTTTAG CAAAGTATTG GTCAAAAAAG 84 0 

TTATAATTGC ACTAATAGAA TGAAGTGGTC AT CGGACTAT GTTAAGCTTT GATAAAGAGA 90 0 

AAAAATAGAG GAGTAAATAT ATGTATAGAG CAGTTATATT TGATTTCGAT GGAACAATAA 9S0 

20 TAGATACGGA ACAACATTTA TTTAATGTTA TTAATAAACA TTTAGAGATG CATAATGCCG 102 0 

ATCCTATAAG CATTGATTTT TATCGTTCTT CTATTGGAGG AG C AG CTACA GATTTGCATG 108 0 

ACCATTTAAT TAAAG CGATT GGTTCGGAAA ATAAAGATAA ACTTTATGAA GAACATCATC 114 0 

25 TTACTAGTAC AACATTACCG ATGATTGATA CGATTAnATC ATTGATGGCA TTTTTAAAGC 12 00 

AACGTCACAT TCCTATGGCA ATTGCCACAA GTAGTGTGAA AGCGGAAATA ATGCCCACCT 12 6 0 

TTAAAGCATT AGGTCTAGAC GATTATATAG AGGTAGTTGT TGGTAGAGAm GATGTTGAAC 13 20 

30 

AAGTTAAACC TGACCCTGAA TTATATTTAT CTGCAGTACA ACAATTAAAT TATATGCCGA 13 8 0 

CACAATGTTT GGCTATTGAA GATTCTGTAA ATGGTGCAAC AGCCGCGATT GCAGCTGGAT 144 0 

^ TAGATGTTAT TGTTAATACG AATAAAATGA CAAGCGCACA GGACTTTTCT AATGTAGATT 15 00 

ATGTAGCAAA AGATATTGAT TACGATCAAA TTGTAGCGCG TTTCTTTACG AAATAGGAGG 15 60 

CGTATCATGA TGGGTTACAT TATATTGTTT TTTCTAGCTG GTCCAGTAAT TTTAGGCGTT 162 0 

40 GGAAATTTGG TGATTGGTCC TATATTTAAC AAACAGACAC CATTTCGCGT GCAAGTAAGA 168 0 

TCTTTTGTTG kTGGkTCmAT GrTTTACTTA ATACTCGCAA CAATTGG CTA TTTTTTACTA 174 0 

TTACAAGGTA AACTTTAACG AGAAAACCAC CTTACCTCAT TAAATGGACG ACCATATGTA 180 0 

45 TGTGAAATGG TAGAACGTTC ATGTTTATGT ATGAGATAGG GTGGTTTAAA TAGTTACATA 18 6 0 

TATTTTAATA ATAACGTCAC GATGATAAGT ACAATTAAGA TAATATCTAT GCCTACCATA 192 0 

ATTGTAGCTC TTGTTGCAIT ACTTCCTTGT TCTTTTGCTG ATTTCATAGC ACGGTAGTTT 198 0 

so 

GGCACAAAGC TAATAATTAG TAAGATTAAT ACAATTACAC CAATTAATGC TGTTGTCATG 204 0 

ATGAACGACC TCCTTTATTT TTTTCAATCA ATTCCCAAAT AAACGTAGCA ATCACAC CG A 210 0 
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CAATAATTAA TGCAAT CGGT AAAGTCGTAC CGAGTTTAAT CTTGCGCTCT GGAGAATTAA 2 22 0 

TAATAGTAAA TACTG TAAG A CAAATGAGTA TGAAAGCAAG TGTTG CAATA ATAGTTCTTC 22 80 

CAACTAAATA TAGGATGTCA GGTTTTTCCA TACCGATATA ATTTATGATG AAAAATGCTA 2 340 

CAGCAAAGAG TACCGATATT TTTGTAGCAC GTAGCAGTAT TTGTTTTAAC ATTGATATAC 24 00 

TCCTTTTTAA TATTATTAAA ATTATATCAT AATTACCAAG AATAG CTGAA GTTGTATGTG 24 60 

ACTCAACGGT ACTTGAG CAA CTTTTTTAAT TTTTTAGAAA AATCACAAAA TAATTGTTTG 2 520 

CAAAGTTGCA AAAGCCTGCT ATAG TAGTTC TGTAAACGAT TGCATGGTAT G CAAATATT A 2 58 0 

ATGTACCAAA ATCGATAATT TATAGTATAA TTACGGCAAT AAGTTTTTTT ATGGATTTAT 264 0 

TTAGTATCAA TCAGAGATGG GGTAAGAAGT TATGGAGAAC AATGAACTAC AAAGGGGATT 2700 

GAGTGCCCGT CAAATTCAAA TGAT TGCACT TGGTGGTACG ATTGGCGTGG GGCTTTTCAT 276 0 

GGGTGCGACA AGTACAATTA AATGGACAGG CCCATCAGTT ATCCTTGCAT ATTTAATTGC 28 2 0 

GGGTATCTTT TTATTTTTAA TCATGAGAGC AATGGGGGAA ATGATTTATT TAAACCCTAC 28 8 0 

AACAGGATCA TTTGCAACAT TTGCAAGTGA TTATATACAT CCTGCAGCAG GTTATATGAC 2 94 0 

AGCATGGAGT AATATATTCC AATGGATTGT AGTTGGTATG AGTGAGGTCA TCGCAGTAGG 3 000 

AGAATATATG AAGTTTTGGT TCCCGGAATT GCCAACTTGG ATTCCTGGTG TTATTGCTAT 3060 

TTTATTATTA ATGGCAGCGA ATTTATTCTC GGTAAAAGCG TTTGGAGAAT TTGAATTTTG 312 0 

GTTTGCTTTA ATTAAAGTTG TAACAATTAT TTTAATGATT ATTGCTGGTT TTGGTCTTAT 318 0 

nnTCTnTGGT TTTGGAAATG GTGGCCATGC GGTAGGTATT TCTAATCTAT GGACAAATGG 324 0 

C 3241 
(2) INFORMATION FOR SEQ ID NO: 646: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646: 

AGGCGTCAAC TCAGATGGTT TAATAATTGC CGTATTACCT GCTGCAATAG CAC CGATTAA 60 

AGGTt CGAaC ACTAGTtGAA AAGGATAGTT AAATGG TG CA ATGATCAAAA CTGTTCCATA 120 

AGGTTCTTTT TTGATATAGC TTTTTGTTGG AAATAAATAT AAAGGTGTGT CTACATTkTT 180 

TGTTTTAGTC CAGTTTTTAA GTTCcTTACG GGCAATTTTG ATACTTyTCA AAGTTATGCC 24 0 
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AATATCGCTC TCGTATGATT TAATAGCTTT GCTTAACTTC TTTAATTGCT CTTTTCTAAA 36 0 

ACTAATATCT TTAGTTTGTT GTGTATTGAA AAAAGCTTTA CTGTCATAAA ATTTTTG CTC 42 0 

AATGATATTC ATAATGAAAA GAACCTCCTT ATATGATTAT TTTGGAAAAA GCGATTAATT 48 0 

GATTTGAATG TTGTGGCCGT TAATTTTAAA TGGTCTTTCG AATTATATAT GTTGAAAGTT 54 0 

GAAAATAGAG CGATGAATCG TGTACATAAT AATATTTATA ACTTTAATCA TAACGAAAAA 6 00 

GGTAGGAAGA AAACAAAAAT TTATACTCAA CATCGCAAAT ATTTTAAGAA AATGTAAAGA 66 0 

CAAAAGGGGA ATTGTATAGA AATCACTAAT CTGTGGGTTA GGGTAGCTAA AGGAATAAAA 72 0 

ACTACTATTG AAAAAGGGTT GTAAATTAGT CAAACGTAAA TAAAAAACAG TTCATTGAAA 78 0 

GTGAAATAAA TTCTACTTTA ATGAACTGTT AGTTAAATAC AACATGTCTA TAATTAGACA 84 0 

GTAATATAGT ATTATTTTGT TAATGCTTCA GTGATTTGAG GTACGATTTG TTTTTTTCGA 90 0 

GAAACGACAC CAGATAAGAA GGCCATGTCA TCTTCTAATT GAACATTGAA TGtTCGCCAA 96 0 

CTTTATCTTT TTCAGCACCT ACAACTAAAA TTTTAGAATC ACTATTAATG ATGTCAGTAA 102 0 

CAACAAGTAC AAATAAGTCA TATTTTTCTT GTGCACTTAC AGCTAACATT TCTTTTTCTA 108 0 

rATCTTCTTT ACGATTTAAC ACTTCGTCAA GGTCAACAGC ATTAACTTGT GCAATACGAG 114 0 

TCACATAGTC ACCCATAGTA AATGATTTAG CATCCATGTT TAATAAGAAT TCAACTGATT 12 0 0 

TATCAGTTGT TGAAGCACCT GCTTTtAACA TATCTAAGCC GTACTTTTGA ATATCAACTT 12 60 

TAGCAATATC TTTnAATTCT tCAGCTGCTT TAACATCTTG TTGTGTACAT G 1311 
(2) INFORMATION FOR SEQ ID NO: 647: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 

CATATACTTT TTCATTTCTT TACGAGATAC TTTACCAGAG GATTTAGACT TCATACGCTG 60 

ATCCATATGT GCTTGCGTTT CAGaATGTCC ACAAACACAA CGATATACCG CTTCTTTCCC 120 

TTTACCAAAC AACGTTAATT TCTTTTTACA GTTTGGACAT CTTGCATTTG TTTTGCGCTG 180 

TACATTCTTT TTCGTCTTAC AAGATGGATC TTGGCACACA AGCATCTGAC CATTTTTAGT 24 0 

TTTAACTTTA ATCATGAATT TACCACACGT TGGGCATTCT GTGGTTGTTA AATTATCGTG 300 

TTTATATTTA CGATCACTAT TTTTAATCCC ATTTACAACA TCTTTCGTAA AATCTTTCAT 360 
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CCATTGTGcA GTTAAAAGTG GCGACGTTAA TTCTTCTGGT GCTAATTCTA ATATTTG TTT 4 80 

ACCTTTTGAC GTTACTTTAA TTTTACCGTC TCTTGATTCA ATGGCATTCA TATTAAATAA 54 0 

TTTATCGATA ATGTCGGCCC TTGTTGCAAC TGTG CCGATA CCACCTGTTT GTTTTAAAGT 6 00 

TTGCGCATAT TTTTTATCCT TCAATTGAAT AAAGTTCTGA GGGTTCTCCA TCGCTTTTAA 660 

TAACGAACCT TCATTAAAAT ATTCTGGAGG TGTTGTTTCA TGTTCTCTAA TATTTGTTTT 7 20 

TGAAATCTTC ACTTCATCGC CTTCTGAAAA AGGCTGTTGC ATCTCTGTAA TAGATTCACC 7 80 

TTGTCTAATA GATTTAAAAC CTAAAACAGT TGTTACATTC TCTTTCAAAA CAAATGTGTG 840 

CCCTGCAACC TCTAAAGTTA CAGTTATCGC GTCATACTCG TGCGGAGGCA TTAAAGCTTC 900 

TAAAAAACGC TCGACAATCA TATCGTATAA CTTTAATTCT CTATTACTTA AGTCTGACAT 96 0 

GACAGGTCTC ACTT CTGTAG GAATAATTGC ATGGTGATCA GATACTTTTT GATTATTAAA 102 0 

TATCGACATT TTTGATGAAA ATGTTTTAGA CATTAATGGG CGTGCTTGGT CTTTATATGT 108 0 

TGTTGCCATC GTCACCTGAA TACGTTCTTT CATAGTATCT ACCATATCAG TTGTTAAATA 114 0 

GTTTGAATCT GTTCTTGGAT AGGTTACGAC TTTATGTCTC TCATATAAGC TTTGAAGTGT 12 00 

ATTCAATGTT TCTTTAGGTC CAATTTTATA ACGTCTATAC ATATCTTGTT GTAAATCTGT 12 6 0 

TAAATTGTAC AGTGATTGCG GATACGACTT CTTATGTTTA GTAGCAACAG ATTTAATCTT 132 0 

ACCATCGACA TTTTTCAAAT T ATT AAC CAT CTGTTCTAAA GTTTCTTTAw TGGtATATCG 13 80 

CTGATTTGaw TCTAGCTGAA AATCAAACCC TTTTACCGTT AATGATAATG TAAAGTATTG 14 4 0 

TTGTGGnTTG AACTGATTAA TCTCTTGTTG GTCGTGTAAT TTACTAAATT GAAACGGn 14 98 
(2) INFORMATION FOR SEQ ID NO : 64 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 64 8: 

TAAAACAATT CGTTTTATTA ATACAGTTTG TAAAAGTATT CTCTTAGTAC TAATAGCTAA 60 

TGTAATGATA GAAAATACTA GTGTTATTAA AGATTTGAAT AAAATAAAAG AAACTGAGAA 120 

ATATTGGAAT GTATTAGATG ATTATTACAC GATTGAATTT GCACCTTATC ACGAAACAAA 180 

ACAAAGTTTG ATTGATAATA TGGTGCGATC AGAACAATTA GTAAAGGCTA GTGAAGCAGA 24 0 

AAATAATGCG ATTTTATTCA AACCAAAGGG TGACTCCGTT GACAATGACA ACTTTTCG CC 3 00 
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TCAACCTGAT ATTCCGATAA AAAATCAAAA AAATAATGTC GAAGTAATTA TTCCACAAAA 4 20 

GTTTCATGCA ATGCGTAATG AAATCAATCA AGCATATCAT TCATGGTTTG AATTTGTACA 480 

AAATAAAAAT AATAAAGAGA ATAAGTTATC TATACAGTTT ATCAACAAAA ATGATTGTCG 54 0 

AATTTTTTCA TTTGATGCAC GAGATAGTCG CCATTTGTCA TTTATAGAGG CGCCAATCAT 6 00 

TGTGAATGTT CAGGCATCAG ATTTATCGaA TGATTTttAT TATGCCATGa TCaGTCaAGG 66 0 

CGGGTATTTa TTCaAAAATT ATGaCGCGCT AGTAAAAAAT ATTGGAAAAG TATCCATCTT 720 

GATGGGGAAA TCCAGTGGAA TAACCAATTA TAAAGATAGC GTGATGGAAA TGTATCATGA 78 0 

AAACAATTTG AAATTAACAG TACTCAACTT TTCACAAATC ATTATCGCAA TCATTTTAAT 84 0 

AATTATTATT TTATTTGATG TGAAATATTA TTTTGAACAG CATCGAAAAT TACT CGTAAT 90 0 

CAAAAAGCTA TATGGTTATT CAACATTAAG AGCCAATTAC CAATACTTAT TAATAAATAA 960 

TATAGTTGTT ATTTTTATTG GAATATTGAC GAATGTAATT TTACATTCTC ACTATATAAT 1020 

GATGTTATTT GCAACGATTC TTGT 1044 
(2) INFORMATION FOR SEQ ID NO: 64 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 

GAACATATTG GGTTATG CAA GGnGGTCACT CTTCACACTT ATAAACAACA TTTTAATAAT 60 

GTAAAGTTTA ACCAGCTAAC ACTTTTGTTA GCTGGTTTTT ATTTTCCTTC AATTTTTAAA 12 0 

TGGTTAAGTC CCCTTCTATA TCTTATAAGA CAATCATTAT AATCAATTCA ATTAATACAT 180 

TAACAACCAC AACTAATAAA TATAGTAACT TCAAAATCCA TATTTATGTC TAAAGATAAT 24 0 

CTCAATGTTG TTCACGTCAA TAAAATTATC CCTAGGTTTT TAAAAATTGT ACATGTTTAA 300 

ACAATCAAAA GTGTACATTA TTAAATTATC ATTTCCAGTT AGATTTAGAA AACATTCACA 3 60 

CCACGCATGG AC CAACGTAT TCGTCTTcAT TcATTTTAG 3 99 
(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 650: 

GTACTAGGTT CTAnGATTCC CTACTGTAGT CATAGATATT TTCCAAACAG AGCCTGACAA 6 0 

5 CTTTTCATTT TTAATAGTGA TCACTATATT AATCGCTATT GCAATTTATG CATGTCGATT 12 0 

TGTTTGGGTT TATTT CTGGT ACAAAGATTT TTATTTCCCG AAAAATATAC AATCTTATCT 180 

AGACGAGGAA CATGATTCAC ATGAAACACC ACCTTCTCGA GTGCGTTACG CATTTATTAT 24 0 

W GACCATGTGT GGTATTCACG GTACAATTTC ACTTTCAATG GCACTTACAT TACCATTTAT 300 

CATTACAAAA GGACAAGCAT TCGAATACCG TAATGATTTA TTGTTTATTG CATCTTTCAT 360 

GGTATTAATT AGTTTAATCT TAGCGCAAAT TGTTTTACCT TTAATTACAC CATCTGCCGA 4 20 

15 

AGATACTACT TTTAAAGGTA TGACTTATCA ATCTGC CAAA ATTTTCATTG TTCAAAAAGT 4 80 

GATCCAGCAT TTTAAAAACG AAAGTAAAAA AGACAAAAAC GATACAAATT ATCGCCCAGT 54 0 

20 ATTAAACCAA TACTATGGAG AATTGTTATT TTTATTAAAT TCAGAACCTG ATAATCAAAA 6 00 

TACrAAAGAa CTCAAACGTT TAGAAGATAT TGCaAAAGTA ATCGAAACAT CTACACTTGA 660 

GCGTTTAATT GATAAAGGTA AGGCAACATA TCAGGATATT AATAATTACC GCAATATTGT 720 

25 CGAATTAACA GAGACACACC GTACTGC 74 7 



(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 
ATACAATACT CTTTTATTAT TCAATAAGCC ACTTCCTATA GCAAATGTTT AACTTTAAAT 60 

40 ATTTTTCGAT GCTAACAAAA AATCACACTA TCATCTTTTA AAATGAAAGT GTGATTACAA 120 
GCAAATCTGT AAAATTTATA AAGCAGAAAC AATTCAACTT TATCATTATG ACATTTCAAT 180 
TAAACCTTCT ACATTATAGT TCCAAGCATC TTACACATGA ATGCAAGTAT TTAACGATTT 24 0 

45 AATTGTGACA TAGCCTGTTG ATATTGTGTT T CATTGAT AT AATTTTGTTG CTTCATTTTT 300 
TCTAAGTTCG TGCTTACACG TTGCGTAAAA TTCTCTGACA TATTATTGAT ATTATATACG 36 0 

CTAGGTGCAT TGACTTTACT AGCTAAAATA GCGCTTTGTA AAACTGTTAT GTGAGACATT 420 

SO 

GTTGTACTAT TTTTATTCAC GGTTGTTCCA AAGTAATGGT TTGCTGCGCC CTCAAGCGTA 4 80 

TATTGATTAT CCCCAAAGTA AATATTATTT AAATAAAAGC TTAAAATTTC GTTCTTATTA 54 0 

55 
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TCATTATCAT AAAAATAATT TTTGACAACT TGTTGTGTAA TGGTACTACC ACCTTGCACA 660 

TCTCTGTCGC TAATCGTTGA AAATAAAGCT CTAGTTGTAC CTTTCAAATC GAATCCATGA 720 

TGATTGTAGA ATCGTTCATC TTCCATTGAA ATAAAGGCAC CTTTAACATA CTCTGGCATG 780 

TTATCAGCTG ACACAAAACT ACTTTTATTT TCAATTTTTC TTAGTTCATC CACATTATCG 84 0 

CGTGTAGATA AAAAATACAT GATACCAATA AACAATGCGA TAATGATTAG AATGGTTAAT 900 

AATATTTTTA ATAGTATTCG TTTACTTTTT TTCTTTTTCG GCGGTTTGCC AACTGGTTGA 960 

TAATACGTAT TATAGTGAGG TTCGTGTTTC ATATGCTCAA AATGTTCATT TGAGTTTGAG 1020 

TACCTATCGC TT C TTTTCAT GCGTTTGCTC CTTCTTTTAA AACTCACTTA GtATATACCT 10 80 

TGaGTTTACC AG TACT AT CA CAAATAGGCT ACACTTTTTG GGAAAATCAG TCCAAGGGCT 114 0 

TACAATCGTA TACGCCATCA TACTTACTTT TTTGTTTTTT GAAAAAATTA TAGATAAATC 12 0 0 

ATTGCAATTT TAAATATTAA TCATGTCAAA TATTGTTATA TTTTATAAAA ATAAAAG AC C 126 0 

ATCCCTATTA AATGCCAATA GAGACGACCT TTTATTTGTT ATTCATTTAT TAAAACTAAA 1320 

ATCCATATTT CATTTCAAAC GAAAATATAT AAATTTTAAC AATCGrTAAC CAC 1373 
(2) INFORMATION FOR SEQ ID NO: 652: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 

GGCAGATAAT TTAGTCATTG TTGAATCGCC TGCAAAAGCA AnAACCATTG AAAAGTATTT 60 

AGGTAAGAAA TATAAAGTTA TAGCTT CAAT GGGACACGT C AGAGACTTAC CAAGAAGTCA 120 

AATGGGTGTC GACACTGAAG ATAATTACGA ACCAAAATAT ATAACAATAC GCGGAAAAGG 180 

TCCTGTTGTA AAAGAATTGA AAAAACATGC AAAAAAAGCG AAAAACGTCT TTCTCGCAAG 24 0 

TGACCCCGAC CGTGAAGGTG AAGCAATTGC TTGGCATTTA TCAAAAATTT TAGAGCTTGA 300 

AGATTCTAAA GAAAATCGCG TTGTTTTCAA CGAAATAACT AAAGACGCTG TTAAAGAAAG 36 0 

TTTTAAAAAT CCTAGAGAAA TTGAAATGAA CTTAGTCGAT GCACAACAAG CGCGTCGAAT 42 0 

ATTAGATAGA TTAGTTGGCT ATAACATCTC GCCAGTTCTT TGGAAAAAAG TAAAAAAAGG 48 0 

GTTGTCAGCG GGTCGAGTTC AATCTGTTaG CmTTCGTTTA GTCATTGACC GTGAAAATGA 54 0 

nATTCGAAAC TTTAAACCAG AnGAATATTG GACTATTGAA GGAGAATTTA GATACAAAAA 600 
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aAAAGATGTT 


GAGAAAATTA CAGCTGCATT AGATGGAGAT CAATTCGAAA TTACAAACGT 


720 




GACTAAAAAA 


GAAAAAACGC 


GTAATCCAGC AAACCCATTT ACAACTTCTA 


CATTACAACA 


7B0 


5 


AGAGGCGGcA 
ATATGAAGGT 


CGTAAATTAA 
ATAGATTTG 


AcTTTmAAGC AAGAAAAACA ATGATGGTCG 


CACAACAATT 


840 
859 


10 


(2) INFORMATION FOR SEQ ID NO: G53 : 






(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653: 






20 


TTCAACTTCG 
TTGATAATTT 


TTTGGAAATC 
TTTGTTTCTT 


ATGTTCCTCA ATTGTTGGAC TTAAACGAAT 


TTCTTTAACA 


60 




TTTCATTTCT TTTTCTTTTT TCTGTTGTTC 


GAATTTGAAT 


120 




TTACCGTAAT 


CCATAATTCT 


TGCAACTGGT GGTTTCGCAT TCGGTGCAAC 


GAC CACTAAG 


180 


25 


TCTAAATCTA 


CACGTTCAGC 


CATTTCTAAA GCTTCACGCT TTGATTTAAC 


ACCAATTTGT 


240 




TCACCATCTT 


GACCGATTAA 


ACGTAATTCT TTTGCACGAA TTTTGTCATT 


GATTT GAGTT 


300 


30 


TGATCTTTTG 


CTATGGTTGA 


CACCTCCAAA ATTTTTACGA AATTTGCACC 


AAGCAAAAAG 


360 


GAAGAGCAGG 


TATAAAATAC 


CCGCTCTTCC TTATACACAG TTATGTGTAA 


TGTGATTAAC 


420 




CTGCCAACTG 


CTTTATGCGT 


CGCTACAGGT GAGAAGCGGG TGCTTCTACT 


TGGTTCGTTT 


480 




CGTATTCAAC 


GTTATTAATC 


ATATCAACAA TTCACATTTA AGTCAACACT 


ATAACTGTAA 






TTATTTTTAT 


TTTAACCTTT 


TATTTCATCC ATTGACACGT CTTGACGTAA 


ATCTACTTGT 


600 




TCTAATGGAA 


TTTTTTTCGT 


TTTATATCGA AG CTTATGAT AAATAAAGAA TGCTAAAAAT 


660 


40 


ACTGGAATTC 


CCATATACGT 


AATTAAGAAG CGACTAAAAT TAAAATCTCC 


TGTnTTAATA 


720 




AAGTCAACAT 


CTTGCCCAAn 


AAnTACT 




74 7 



(2) INFORMATION FOR SEQ ID NO: 654: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 54: 

55 
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TTGTGCTTCT TTTTTAGCTT CTTGAACCTC TTGTGCCTCT TGTGATGTAT CACTyAAATT 120 

ATTTGCACTT GCTTCTTCTT TTATCGCTGC TTGTTGTGCT TTCAATGCCA CTGCTTTTGr 180 

TTCTTyATTT GATACAGCCA CACTTTTATC CGCTTCTGCT TGTGCTTCTC TTTTAGCTTC 24 0 

TTGAATCTCT TGTGCTTCTT GTGATGTATC ACTTAAATTA TTTGCACTTG CTTCTTCTTT 300 

TATCGCTGCT TGTTGTGCCT TTAATGC CG C TTGCTCATTT TTAGATTTGT TTAAAAATC C 36 0 

TTCAACACGT TCTTTTGTAT AGGCAACCGT TTCTTCAAGT TGCGTTTTTC TTTCTTCAAA 42 0 

CTTTTGCGAC AGTTCTTGTn CTTTGACTTT nAAATCATCT GCTTTTTGAT AAACTTTATT 480 

TTAAAATACC AACCTAAAGC C 501 
(2) INFORMATION FOR SEQ ID NO: 655: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 55: 

CACCTTGTCA TAATTAATTT TTTGATTTTT CATTTTACTG ATAATAGGTT CAGCATTAAT 6 0 

CATGATTTAA CCTCCCACAT TTAATCATTA ACTTCTATTA TATATGATTC ATATTAAATG 120 

TCAGTCAAAA AAGTTAGAAA TTCATTTTAA TGCATTTATA TTT CGAAAAT CCCTTATGTA 180 

TCTAAAAGCA TTTTTTAAAC TTGAATTTTA AAACACTAAA CAACACATAC GTCTCTGTGT 240 

CATTTTCATT TTTTGTATGT CATATATATG TTTACTTCAT TTAAATCAAT TTCATCTTAT 300 

AATTTAT CGT GTATTTTACA AAAGATTGAC TTCAATTCAT CGTAAAAGTT ATACTTTTGC 360 

CATTTTTTAA TGTAACATGG TGTTAGTAAT AAAAATAATA CATTGAGGTG TTTTACATGA 420 

CAGCATTATT CCCTTATATC GCTTTTGAAA ATTCAAAAGA AGCCCTTGCA TATTACGAAG 480 

AAGTATTTGG TGCAACTGAC GTTAAACGTT TAGAAGTTGG CGAAGAACAA GCGTCACATT 54 0 

TTGGTATGAC TAAGGAAGAA GCGCAAGAAG CAACTATGCA TGCTGAATTT GAAGTGCTTG 600 

GCGTAAAAGT GTTATGTTCT GATTCTTTTG GTCGCGCTGA CAAAATTaAT AATGG CAT AT 66 0 

CATTATTAAT TGATTATGAT GTTAACAATA AGGAAGATGC TGATAAAGTT GAAGCATTCT 720 

ATG AG CAAAT TAAAGATCAT TCTTCAATTG AAATAGAATT AC CGTTTGCT GACCAATTCT 780 

GGGGTGGCAA AATGGGCGT C TTTACCGATA AATACGGTGT TCGTTGGATG 83 0 
(2) INFORMATION FOR SEQ ID NO: 65 6: 
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